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METHOD FOR EXPRESSING COLOR PICTURE AND METHOD FOR SEARCHING COLOR 
PICTURE 



PUB. NO. : 
PUBLISHED: 
INVENTOR (s) : 
APPLICANT (s) 
APPL. NO. : 
FILED: 
PRIORITY: 

INTL CLASS: 



2000-348179 [JP 2000348179 A] 
December 15, 2000 (20001215) 
CIEPLINSKI LESZEK 
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2000-125633 [JP 2000125633] 
April 26, 2000 (20000426) 

9909961 [GB 999961], GB (United Kingdom), April 29, 1999 
(19990429) 

G06T-007/00; G06F-017/30 ; G06T-001/00 



ABSTRACT 

PROBLEM TO BE SOLVED: To provide a method and a device which accurately and 
flexibly express and retrieve pictures with a high storage efficiency 
and a high calculation efficiency . 

SOLUTION: The method for expressing a color picture includes a step where 
the area of a picture is selected, a step where one or a plurality of 
colors are selected as the representative colors of the area, and a step 
where at least two parameters related to the color distribution of each of 
representative colors are calculated for an area having the plurality of 
representative colors and are used to derive the descriptor of the picture 
area. 



COPYRIGHT: (C) 2000, JPO 
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SIMILAR NAME RETRIEVAL SYSTEM AND RECORD MEDIUM 
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December 18, 1998 (19981218) 
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, JP (Japan) 
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Japanese Company or Corporation) 



Computer Applications) 



ABSTRACT 

PROBLEM TO BE SOLVED: To retrieve a similar character string without 
dispersion from a data base without a similarity dictionary by 

providing a means for retrieving the data base based on a character 
string after being abbreviated and finding the similar character string. 

SOLUTION: A character string abbreviation means 3 abbreviates inputted 
character strings to less character strings by referring to a conversion 
table 4. Then, based on the character string after being abbreviated, the 
data base 5 is retrieved and a pseudo character string is found. At the 
time, the continuous vowels of the object of abbreviation included in the 
character string and the vowel after the abbreviation are registered to the 
conversion table 4 in correspondence. Also, for the character strings of 
the data base 5, specified consonants are eliminated and they are 

abbreviated to the less character strings by referring to the conversion 
table 4 and registered in correspondence. Thus, by eliminating the 
specified consonants from the input character string, converting the 



continuous vowels by referring to the conversion table 4, abbreviating the 
character string, then retrieving the data base 5 and finding the 
similar character string , retrieval is quickly performed . 
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RETRIEVAL DOCUMENT PREPARING DEVICE, RETRIEVAL DOCUMENT STORAGE MEDIA, 
DEVICE AND METHOD FOR RETRIEVING DOCUMENT 
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ABSTRACT 

PROBLEM TO BE SOLVED: To improve selectivity at the time of retrieval while 
reducing the burden of a user by storing words contained in document data 
correspondently to the relation of their positions by storing inputted 
document data, analyzing these data, segmenting them into words and storing 
the segmented words correspondently to the relation of positions of these 
words . 



SOLUTION: The document data are stored in an external storage device 3 as a 
retrieval data base for each document, morpheme analysis is performed, 
the words contained in these data and its appearance frequency are 
investigated and extracted for each document, and a retrieval word data 

base is prepared and stored in an external storage device 3. When a user 
inputs a retrieval keyword from a user input device 6, a retrieval part 
prepares a retrieval expression and retrieves the docioment data by 
using that expression . Retrieved result data are successively 

analyzed by an analytic part. The analytic part judges the 
presence/absence of document data to be referred to in the retrieval data 
base and when there is no data, , processing is finished. When there are 
such data, the presence/absence of the word corresponding to the retrieval 
keyword among wrods appearing in the document data is judged and on the 
stage when the correspondent word is completely referred to, the degree of 
importance is calculated. 
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METHOD AND DEVICE FOR EVALUATING INFORMATION RETRIEVING METHOD 
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ABSTRACT 



PURPOSE: To provide the method and device for information retrieving method 
evaluation which can securely judge an optimum information retrieving 
method. 

CONSTITUTION: The number of external evaluated values is counted, and when 
an judgement of an information retrieving method for one data is the same 
as on the side of a larger number of judgements by the given external 
evaluated values, a constant value generated by a constant numerical value 
generation module 4 is regarded as one data evaluated value. When the 
judgement is the same as on the side of a smaller number of judgements, the 
value obtained by a penalty calculation module 5 is regarded as one data 
evaluated value and one data evaluated value is generated as to a 
combination of all the given retrieval object data and data in a data 

base ; and a mean penalty calculation module 6 calculates the mean 
value of all the data evaluated values and retrieval efficiency is 
represented as an evaluated value of the retrieval efficiency of the 
information retrieving method by using one index. Evaluated values 
calculated for respective information retrieving methods are put in 
order to decide whether various information retrieving methods are superior 
or inferior. 
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45.3 



ABSTRACT 

PURPOSE: To enable efficiently recognizing a character string in an 

information retrieving device or an all-sentence data base 

-retrieving device and a keyword extracting device, etc., by finding the 
similarities of a character string and a character string for comparing 
with the string. 

CONSTITUTION: The character string recognizing method performs a character 
coincidence recognition 204 recognizing the correspondence of the 
coincidence of a character and a character one to one for recognizing the 
sequence of characters, a character similarity degree evaluation 
calculation 205 calculating the similarity degree of characters, a 
character coincidence sequence evaluation calculation 207 performing the 
evaluation calculation of the sequence of the matched characters and a 
character coincidence location evaluation calculation 208 performing the 
evaluation calculation by the location and place of the matched characters, 
in order to reflect the sequence of the character string and the location 
of the character in the character string as feature amount, and reflects 
all the feature amount of the character string length, the number of 
character coincidence, the sequence of the matched characters, and the 
location and the weighing of the matched characters between the character 
strings in place of a character string similarity degree evaluation 
calculation 205 for which the conventional character string length and the 
number of character coincidence are reflected as the feature amount 
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Section: P, Section No. 1606, Vol. 
September 03, 1993 (19930903) 



17, No. 488, Pg. 162, 



ABSTRACT 

PURPOSE: To attain a position detection by a sub-pixel efficiency beyond 
a template matching by a picture element unit by a rather simple 

calculation , without necessitating storing a correlation value pattern. 

CONSTITUTION: The correlation values of a picture to be detected and a 
template picture are calculated while a picture position is shifted by the 
picture element unit, and the picture position where the correlation value 
is the maximum is retrieved (2) . Then, the positions and correlation values 
of the neighboring 8 picture elements are searched (3) . Then, a 
multivariable and polynomial regression curved surface is decided from the 

searched position and correlation value of each picture element (4). 
Then, a position where the maximum correlation is applied is searched 

from the above mentioned searched multivariable and polynomial 
regression curved surface (5) . The position is searched by an efficiency 
which is more detail than the picture element. 
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WPI Acc No: 2001-464210/200150 
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Performance measurement method of object oriented application 
program, involves obtaining average response time for application 
program by dividing sum of measured elapsed time by number of user input 
functions 

Patent Assignee: UNISYS CORP (BURS ) 
Inventor: PURAM C K 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6269479 Bl 20010731 US 98203201 A 19981130 200150 B 

Priority Applications (No Type Date) : US 98203201 A 19981130 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 6269479 Bl 4 G06F-009/45 

Abstract (Basic): US 6269479 Bl 

NOVELTY - An elapsed time between the initiation of several user 
input function and the response provided by the application program to 
the input function, is measured and the measured values are added. An 
average response time representing the performance of the application 



program for the application program, is obtained by dividing the sum of 
measured elapsed time by the number of user input functions. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
computer readable medium storing the performance measurement 
instruction of object oriented program. 

USE - For measuring the performance of the object oriented 
application program run on computer. 

ADVANTAGE - By normalizing the average response time with respect 
to database size, more accurate and effective comparison of 
application programs that operate upon different size databases is 
achieved . 

DESCRIPTION OF DRAWING (S) - The figure shows the flowchart of 
method for measuring the performance of object oriented application 
program. 

pp; 4 DwgNo 1/1 
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Information retrieval performance evaluation method involves 

extracting search objective information with respect to input search 

demand to judge search process 
Patent Assignee: SEIKO EPSON CORP (SHIH ) 
Inventor: KAYAHARA N 

Number of Countries: 002 Number of Patents: 002 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 2000276470 A 20001006 JP 9977896 A 19990323 200065 B 

US 6615200 Bl 20030902 US 2000533100 A 20000322 200359 

Priority Applications (No Type Date) : JP 9977896 A 19990323 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 2000276470 A 9 G06F-017/30 

US 6615200 81 G06F-017/30 

Abstract (Basic): JP 2000276470 A 

NOVELTY - A title corresponding to a text and the text are treated 
as a pair of matched documents so that either a title or a test is 
searched by inputting its complementary pair. A search performance 
is judged based on a search result obtained with respect to the input 
demand . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) information retrieval performance evaluation apparatus; 

(b) recording medium for storing information retrieval performance 
evaluation process program 

USE - In information retrieval performance evaluation system. 

ADVANTAGE - Enables to maintain search capability at constant 
level. Enables to carry out appropriate search process. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
search performance evaluation system. 

pp; 9 DwgNo 1/6 
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SYSTEM AND METHOD FOR PROCESSING INFORMATION 
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45.1 (INFORMATION PROCESSING — Arithmetic Sequence Units) 



ABSTRACT 

PROBLEM TO BE SOLVED: To improve a job efficiency of a user by determining 
a task to be next executed based on 1st information , finding the 
candidate of solution of the task by performing prescribed search 
processing based on 2nd information previously determined by that 
determined task, and performing prescribed post- processing based on that 
candidate of the solution. 

SOLUTION: The task to be first executed is determined and when the 
conditions of execution are turned to any prescribed state, a task system 
determining part 12 determines the task to be next executed based on the 
1st information determined in advance. Concerning that determined task, 
prescribed search processing is performed based on the predetermined 2nd 
information, and the candidate of solution of the task is found by an 
individual task processing determination part 14 corresponding to a search 
processing means. Based on that found candidate of the solution, the 
prescribed post-processing is performed by an individual task processing 
execution part 15 corresponding to a post- processing means. Thus, the job 
efficiency or result of the user can be improved 
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Aggregation set designing method for online analytical processing, 
involves adjusting benefit/cost rating of aggregation that varies between 
candidate and selected sets of respective aggregation maintained 

Patent Assignee: MICROSOFT CORP (MICT ) 

Inventor: NETZ A 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6366905 Bl 20020402 US 99338212 A 19990622 200246 B 

Priority Applications (No Type Date) : US 99338212 A 19990622 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 6366905 Bl 15 G06F-017/30 

Abstract (Basic) : US 6366905 Bl 

NOVELTY - The benefit/cost rating for an aggregation determined as 
a function of performance improvement and computer resources cost 
associated with the aggregation. A candidate set and a selected set of 
candidate aggregation and selected aggregation are maintained 
respectively. The benefit/cost rating for possible aggregation is 
adjusted moves in response to variation of aggregation between the 



candidate and the selected sets. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
following: 

(a) Computer-readable medium storing aggregation set designing 
program; and 

(b) Computer arrangement for designing set of aggregations. 
USE - For designing aggregation set for online analytical 

processing (OLAP) of data warehouse and business analysis system, in 
enterprise wide computer networks, intranets and internet. 

ADVANTAGE - The aggregation are computed and materialized even 
before a query is issued so that the data answering the query are 
presented to the user quickly. The aggregation eliminates the need to 
retrieve and manipulate more detailed data to answer the qpiery . 
Efficient use of computing resources is provided. The aggregation 
are selected quickly and efficiently even in OLAP cubes having many 
dimensions and levels. A balance is achieved between cost and 
performance of an aggregation that is best suited for a particular 
application. Because the size of an aggregation is independent of 
external variables such as processor speed, aggregation size is a 
particularly advantageous cost metric. 

DESCRIPTION OF DRAWING (S) - The figure shows the flowchart 
depicting a process for updating benefit/cost rating in response to the 
removal of selected aggregation. 

pp; 15 DwgNo 5/5 
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Distributed search device feeds back evaluation information acquired from 

evaluation program to information retrieval unit 
Patent Assignee: FUJITSU LTD (FUIT ) 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 2002063164 A 20020228 JP 2000250046 A 20000821 200232 B 

Priority Applications (No Type Date) : JP 2000250046 A 20000821 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 2002063164 A 15 G06F-017/30 

Abstract (Basic) : JP 2002063164 A 

NOVELTY - An information retrieval unit searches the information 
about a person's search information. A program provision unit provides 
an evaluation program to the client. The evaluation information is 
acquired from the evaluation program. A feedback processor (136) feeds 
back the evaluation information to the information retrieval unit. 

USE - For distributed search system. 

ADVANTAGE - Effective evaluation information can be obtained 
easily. Effectively finds the information resource which satisfies 
user's desire based on effective evaluation information by search 
and sort . 

DESCRIPTION OF DRAWING (S) - The figure shows the hardware block 
diagram of the distributed search device. (Drawing includes non-English 
language text) . 

Feedback processor (136) 
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Abstract (Basic) : JP 2001067369 A 

NOVELTY - A data conversion unit converts the attribute values in 
several information sources into preset format. Index generation unit 
generates index to the value output by data conversion unit. Search 
type evaluation unit (711) performs data search pertaining to enquiry 
using the generated index. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(a) Information retrieval procedure; 

(b) Information retrieval program 

USE - Information retrieval system for retrieving data stored in 
data warehouse and data mart . 

ADVANTAGE - Since index value is used for searching, search process 
is performed quickly. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
information retrieval system. (Drawing includes non-English language 
text) . 

Search type evaluation unit (711) 
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Abstract (Basic) : JP 2000305945 A 

NOVELTY - The display position of each information is determined 
and displayed to mutually show the search result information belonging 
to each group. Each search result information is compared to each other 
to obtain the similarity between each search result information. Each 
information is compared with various search conditions stored in an 
information memory {103a) . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) an information retrieval apparatus; 

(b) and a memory medium. 
USE - For computer. 

ADVANTAGE - Does not require evaluation of each information in 
group. Increases efficiency when evaluating search result. 
Ensures smooth control of information retrieval apparatus. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
an information retrieval apparatus. 

Information memory (103a) 
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involves grouping results by categories of features of underlying 
information in user selected results, for display 
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Abstract (Basic) : US 6028605 A 

NOVELTY - Based on meta data and user query, panels are displayed 
in results window. The user is enabled to page via panels, using 
virtual buttons. A query is performed by conducting semantic 
analysis on meta data representing semantic model of information on 
retrieval system. The features of underlying information is 
extracted from user selected results and results grouped by categories 
of features is displayed. 

DETAILED DESCRIPTION - The information categories are displayed for 
each user according to each users pattern of interaction with GUI. An 



' INDEPENDENT CLAIM is also included for a program storage device. 

USE - For searching information in Internet, using GUI. 

ADVANTAGE - Searches information efficiently without requiring that 
a user be an expert in search technology. Reports the results of search 
in organized fashion and presents the information by categories and 
relations. Enables user to extract information from search results and 
use them to refine or create subsequent searches. 

DESCRIPTION OF DRAWING (S) - The figure shows screen display of meta 
data search results window. 
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Information retrieval apparatus for searching specific information , 
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conditions 
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Abstract (Basic) : JP 2000181923 A 

NOVELTY - The apparatus has an input unit (101) to receive search 
condition for retrieving. The degree of importance of each word in the 
search conditions is calculated by performing syntactic analysis of the 
search conditions by importance degree calculation unit (103) . The 
degree of concurrence of search and the search condition are calculated 
by a calculation unit (104) and the search result is output. 

USE - For information retrieval from larger volumes of information. 

ADVANTAGE - Enables easy and faster information retrieval. 
DESCRIPTION OF DRAWING (S) - The figure shows the functional 
components of the information retrieval apparatus. 
Input unit (101) 
Calculation units (103,104) 
pp; 17 DwgNo 1/16 
Title Terms: INFORMATION; RETRIEVAL; APPARATUS; SEARCH; SPECIFIC; 

INFORMATION; RETRIEVAL; DATA; PERFORMANCE; SYNTACTIC; ANALYSE; INPUT; 
SEARCH; CONDITION; CALCULATE; DEGREE; CONCURRENT; SEARCH; SEARCH; 
CONDITION 
Derwent Class: TOl 

International Patent Class (Main) : G06F-017/30 
File Segment: EPI 



File 348: EUROPEAN PATENTS 1978-2003/Oct WOl 

(c) 2003 European Patent Office 
File 349: PCT FULLTEXT 197 9-2002/UB==20031009, UT-20031002 

(c) 2003 WIPO/Univentio 

Set Items Description 

51 233132 (PERFORM? OR PROFICIEN? OR EFFECTIV? OR EFFICIEN? OR SUCCE- 

SS?) (5N) (RULE? ? OR TEMPLATE? ? OR STEIATEG? OR FILTER? ? OR P- 
LAN OR PLANS OR POLICY OR POLICIES OR PROFILE? ? OR METHOD? OR 
EXPRESSION? ? OR STATEMENT? ? OR PHRASE? ? OR STRING? ?) 

52 15093 (PERFORM? OR PROFICIEN? OR EFFECTIV? OR EFFICIEN? OR SUCCE- 

SS?) (5N) (SEARCH OR QUERY OR QUERIES) 

53 188068 (MATCH??? OR LOCATE? ? OR LOCATING OR FIND??? OR RETRIEV? - 

OR OBTAIN?) (5N) (RECORD? ? OR DOCUMENT? ? OR FILE? ? OR PAGE? ? 
OR WEBPAGE? ? OR SITE? ? OR WEBSITE? ? OR HIT? ? OR URL? ? OR 
RESOURCE 0 LOCATOR? ? OR OBJECT? ? OR DATA) 

54 146830 (MATCH??? OR LOCATE? ? OR LOCATING OR FIND??? OR RETRIEV? - 

OR OBTAIN?) (5N) (IMAGE? ? OR PICTURE? ? OR PHOTO? ? OR PHOTOGR- 
APH? ? OR CLIP? ? OR INFORMATION OR ARTICLE? ?) 

55 1190559 DATABASE? ? OR DATA()BASE? ? OR REPOSITOR??? OR SEARCH??? - 

OR QUERY??? OR QUERIE? ? 

56 670 SI (5N)S3:S4 (5N) (MEASUR? OR ASSESS? OR EVALUAT? OR ANALYZ? - 

OR ANALYS? OR CHECK??? OR GAUG??? OR DETERMIN? OR STATISTIC? - 
OR ESTIMAT??? OR TEST??? OR CALCULAT? OR ASCERTAIN? OR FIND? - 
OR COMPUTE OR COMPUTES OR COMPUTED OR COMPUTING) 

57 66 S6(S)S5 AND IC=G06F 

58 345 S2 (5N) S3:S4 (5N) (MEASUR? OR ASSESS? OR EVALUAT? OR ANALYZ? - 

OR ANALYS? OR CHECK??? OR GAUG??? OR DETERMIN? OR STATISTIC? - 
OR ESTIMAT??? OR TEST??? OR CALCULAT? OR ASCERTAIN? OR FIND? - 
OR COMPUTE OR COMPUTES OR COMPUTED OR COMPUTING) 

59 229 S8 AND IC=G06F 

510 24 S9/TI,AB,CM 

511 22 SIO NOT S7 

512 76967 (MATCH??? OR LOCATE? ? OR LOCATING OR FIND???) {5N) (RECORD? 

? OR DOCUMENT? ? OR FILE? ? OR PAGE? ? OR WEBPAGE? ? OR SITE? 
? OR WEBSITE? ? OR HIT? ? OR URL? ? OR RESOURCE () LOCATOR? ? OR 
OBJECT? ? OR DATA) 

513 43909 (MATCH??? OR LOCATE? ? OR LOCATING OR FIND???) (5N) (IMAGE? ? 

OR PICTURE? ? OR PHOTO? ? OR PHOTOGRAPH? ? OR CLIP? ? OR INF- 
ORMATION OR ARTICLE? ?) 



S14 


794 


SI (10N)S12:S13 


S15 


210 


S14 (S)S5 


S16 


117 


S15 AND IC=G06F 


CI '7 


81 


816 NOT (S7 OR Sll) 


S18 


585 


S2 (10N)S12:S13 


S19 


369 


S18 AND IC=G06F 


S20 


43 


S19/TI,AB,CM 


S21 


17 . 


820 NOT (S7 OR Sll OR S17 



(Item 4 from file: 348) 

DIALOG (R) File 34 8: EUROPEAN PATENTS 

(c) 2003 European Patent Office. All rts. reserv. 

01396405 

Document retrieval and classification method and apparatus 

Verfahren und Einrichtung zur Wiederauf f indung und Klassif izerung von 
Dokumenten 

Procede et appareil pour la recherche et classification de documents 

PATENT ASSIGNEE: 

Matsushita Electric Industrial Co., Ltd., (1855508), 1006, Oaza-Kadoma, 
Kadoma-shi, Osaka 571-8501, ( JP) , (Applicant designated States: all) 
INVENTOR: 

Noguchi, Naohiko, 1323-601, Futoo-cho, Kouhoku-ku, Yokohama 222-0031, 
(JP) 

Kanno, Yuji, 1-1-19, 0-oka, Minaitii-ku, Yokohama 232-0061, (JP) 
Sato, Mitsuhiro, 4-11-11, Miyanosato, Atsugi-shi, Kanagawa-ken 243-0216, 
(JP) 

Ito, Hayashi, 177-1-302, Nishimabashi, Aikawa-cho, Matsudo-shi, Chiba-ken 
271-0045, (JP) 

Fukushige, Yoshio, 3-4-2-203, Daigiri, Fuj isawa-shi , Kanagawa-ken 
251-0002, (JP) 

Inaba, Mitsuaki, 2-20-5-628, Higashi' Rokugou, Ota-ku, Tokyo 144-0046, 
(JP) 

LEGAL REPRESENTATIVE: 

Dempster, Benjamin John Naftel et al (62251), Withers & Rogers, Goldings 
House, 2 Hays Lane, London SEl 2HW, (GB) 
PATENT (CC, No, Kind, Date) : EP 1182580 Al 020227 (Basic) 
APPLICATION (CC, No, Date) : EP 2000307256 000823; 
DESIGNATED STATES: DE; FR; GB 

EXTENDED DESIGNATED STATES: AL; LT; LV; MK; RO; SI 
INTERNATIONAL PATENT CLASS: G06F-017/30 

ABSTRACT EP 1182580 Al 

In a document retrieval and classification system, a retrieving 
operation is performed on a database of documents in accordance with 
retrieval conditions entered by a user so as to pick up retrieved 
documents as intended. The user is allowed to input classification 
standards of a plurality of classifications in response to the retrieved 
documents picked up by the retrieving operation. The classification 
standards are converted into retrieval conditions. The similarity between 
the converted retrieval conditions resultant from the classification 
standards and the retrieved documents picked up by the retrieving 
operation is calculated. And, an attribute of each retrieved document 
picked up by the retrieving operation to each classification is 
calculated with reference to the similarity, thereby classifying each 
retrieved document into a classification having a highest attribute. 

ABSTRACT WORD COUNT: 128 

NOTE: 

Figure number on first page: 1 

LEGAL STATUS (Type, Pub Date, Kind, Text): 
Application: 020227 Al Published application with search report 

Examination: 020918 Al Date of request for examination: 20020719 

LANGUAGE ( Publication, Procedural, Application) : English; English; English 

FULLTEXT AVAILABILITY: 

Available Text Language Update Word Count 

CLAIMS A (English) 200209 1313 

SPEC A (English) 200209 6813 
Total word count - document A 8126 
Total word count - document B 0 
Total word count - documents A + B 8126 

INTERNATIONAL PATENT CLASS: G06F-017/30 



.CLAIMS classification, 
keywords are extracted from the extracted documents, 

a set of the extracted keywords is converted into retrieval conditions. 



and 

the similarity between the converted retrieval conditions and the 
retrieved dociiments picked up by the retrieving operation is 
calculated . 

12. A dociament retrieval and classification method comprising the 
steps of: 

performing a retrieving operation on a database of documents in 
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.SPECIFICATION to find a target image and alternatively, 2) the total 
number of images examined until the target is found. 

With these metrics in mind, one test of the effectiveness of an 
image - retrieval method and system is the average difficulty (over 
many trials) a user experiences in finding a single, randomly selected 
target image from within a large database . Those skilled in the art can 
readily appreciate that several variations of this test are possible. For 
example, a target image can be displayed on a computer monitor beside an 
image-retrieval interface for the duration of a search . Or the target 



image may be displayed for only a short time interval before the search 
begins and then the user must search for the remembered image. 
Alternatively, the target image may be distorted, e.g. a hardcopy with 
color distortion, from the true image present in the database . This 
latter procedure may be useful to determine which distortions/features 
are important for image retrieval. The difficulty the user experiences in 
finding a target... 

...quantified in a variety of ways, i.e. the amount of time or number of 
user interactions required {i.e. mouse clicks, iterations of the search 
, keystrokes, etc.). 

Now consider a case where a user is searching for a specific datum in a 
database by means of a series of display. , , 
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example, a target image can be displayed on a computer monitor beside an 
image-retrieval interface for the duration of a search . Or the target 
image may be displayed for only a short time interval before the search 
begins and then the user must search for the remembered image. 
Alternatively, the target image may be distorted, e.g. a hardcopy with 
color distortion, from the true image present in the database . This 
latter procedure may be useful to determine which distortions/features 
are important for image retrieval. The difficulty the user experiences in 
finding a target... 

...quantified in a variety of ways, i.e. the amount of time or number of 
user interactions required (i.e. mouse clicks, iterations of the search 
, keystrokes, etc.). 

Now consider a case where a user is searching for a specific datum in a 
database by means of a series of display. . . 
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...SPECIFICATION string operation mutation checking and the string 

operator mutation itself are carried out with the LSE data structures, 
without retrieval of data values from the data base storage. After 
all string operators that can be mutated have been mutated, the 
assignment statement is ready for deferred evaluation , at which time 
the data values are retrieved from the data base storage and a 
conventional deferred evaluation scheme is performed on the mutated 
assignment statement . The LSE data structures are then deleted. Using 
LSE data structures for indirect processing of data further simplifies 
data base access operations. 
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SPECIFICATION Background of the Invention 

This invention relates to pattern matching, such as arises in complex 
data base systems, and more specifically, in rule-based artificial 
intelligence systems (i.e., so-called "expert systems") in which the 
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...CLAIMS assignable to the applicable diagnoses (AD) found in step d) , 
and storing said diagnosis associated relations in a further memory 
element (R) , 

f) comparing said diagnosis associated relations (R) with said 
qualitative data (2), in order to find out whether one or 
more of said necessary but not sufficient conditions are met, whereby 
such conditions are selected as search roots and are stored in a 
sixth memory element (6) within said second partition of said 
knowledge data base (KB) , 

g) in a third partition of said knowledge data base containing 
diagnosis context independent cause-effect relations between the 
computer installation variables, stored in an eighth memory element 
(8), searching one or more relations between said search roots 



(6) stored in step f) and said context independent cause-effect 
relations of the computer installation variables, and storing the 
found relations in an. . . 

...relations of step g) with the qualitative data (2), whereby the matched 
qualitative data is stored in a seventh memory element (7) within 
said knowledge data base (KB) , 
i) comparing said matched qualitative stored data (7) and the search 
roots (6) of step f) with a set of evidence relations between the... 
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stream is passed through the sequence of cells, each cell performing a 
logical operation based only on the data provided to it from the previous 
cell in the sequence. Each cell has a chracter register for data storage 
and a pattern register for storage of part of a search pattern. The 
contents of the two registers are compared in each cell, at each cycle of 
a clock used to propagate the data through the processor. Match 
indicators or match tolerance values are propagated through the processor 
on a match line, and match results emerge in synchronism with the data 
stream. Multiple match lines are employed in one preferred embodiment, to 
temporarily save, retrieve and exchange match tolerance values, in order 
to effect logically complex searches in a highly concurrent manner. Types 
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.SPECIFICATION Boolean algebra, is described in GB-A-2 156115 . This has a 
plurality of serially connected cells which hold a search pattern in 
pattern registers. A test stream is passed through the cells, and the 
presence or not of a match with the search pattern is indicated 
on a match line formed by serially connecting a delay element of each 
cell. At each cell there is logic for clearing the match line in the 
absence of a match between the search pattern and the test stream. 

However there is still a need for an improved special-purpose processor 
that can perform a variety of search functions... 
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. English Abstract 

Image analysis techniques may be provided. Location of objects in an 
image may be determined based on intensity characteristics of pixels in 
the image. Objects that have been located may be mapped to a source for 
the objects based for example, on a grid structure that may have been 
used to place the objects. Differential analysis of objects of two source 
materials in images may be determined based on aligned versions of the 
images. Filtering may be used to weigh pixel characteristics. Such object 
analysis techniques may have been encoded into a set of 

machine-executable instructions and stored on a machine-readable storage 
medium for use by equipment that is to perform the techniques. 

French Abstract 

Des techniques d' analyse d' images peuvent etre obtenues . L ' emplacement 
d'objets dans une image peut etre determine sur la base de 
caracteristiques d*intensite de pixels dans 1* image. Les objets ayant ete 
localises peuvent etre cartographies sur une source pour les objets 
bases, par exemple, sur une structure de grille pouvant avoir ete 
utilisee pour placer les objets. Une analyse dif f erentielle d'objets de 
deux matieres de sources dans des images peut etre determinee sur la base 
de versions alignees des images. Un filtrage peut etre utilise pour 
ponderer des caracteristiques de pixels. Ces techniques d' analyse 
d'objets peuvent avoir ete codees en un jeu d ' instructions executables en 
machine et stockees sur un support de stockage exploitable par machine et 
destine a etre utilise par un materiel executant les techniques. 
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19th month from priority date 
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Claims 

Claim 

... is centered on another one of the pixels. 

79 A machine-readable storage medium 

encoded with a set of machine executable instructions 
for using image analysis equipment to perform a method 
for automatically locating spots in an image based on 
intensities of pixels in the image, comprising: 

searching for a first pixel that has an 
intensity that is approximately above a threshold 
intensity; 

determining that a part of a new spot is 
located at the first pixel in response to said 
searching ; 

searching only nearest neighbors 
recursively starting from the first pixel to identify 
which ones of the pixels in the image are part of the 
new spot based on the threshold intensity; and 
repeating said searching for a first 

pixel, said determining, and said searching to identify 
a high density array of spots in the image. 8 

80 The machine-readable storage medium of 
claim 79 wherein said searching only. . . 

...pixels that are part of the new spot. 

84 A machine-readable storage medium 

encoded with a set of machine executable instructions 

for using image analysis equipment @ to perform a method 



for automatically locating spots in an image based on 
intensities of pixels in the image, comprising: 

searching serially for a first pixel 
that has an intensity that is approximately above a 
threshold intensity; 

determining that part of a new spot is 
located at the first pixel in response to said 
searching ; and 

searching only nearest neighbors 
recursively starting from the first pixel to identify 
which ones of the pixels in the image are part of the 
new spot... a plurality of regions 
in the composite image. 
119. A machine-readable storage medium 
encoded with a set of machine executable instructions 
for using image analysis equipment to perform a method 
for automatically determining where spots in an image 
are located based on intensities of a plurality of 
pixels in the image, comprising: 
determining a threshold intensity for a 
particular image region based on intensities of... said 
determining, whether one of the 
spots. 0 

125. A machine-readable storage medium 

encoded with a set of machine executable instructions 

for using image analysis equipment to perform a method 

for automatically locating spots in an image based on 

intensities of pixels in the image comprising: 

determining an integrated intensity for 

each of a plurality of regions in the image; 

sorting the . . . 

.is approximately within the 
threshold range. 

130. A method for automatically locating 

objects in an image based on intensities of pixels in 

the image, comprising: 

searching for a first pixel that has an 
intensity that is approximately above a threshold 
intensity; 

determining that a part of a new object 
is located at the first pixel in response to said 
searching ; and 

searching only nearest neighbors 
recursively starting from the first pixel to identify 
which ones of the pixels in the image are part of the 
new object . . . 
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English Abstract 

A system and method for indexing, searching and performing related 
operations on the Internet uses an automated browser (10) to find 
Internet web pages that are bookmark files (20) . These may be 
identified by being of a particular type of file, such as a "NETSCAPE" 
bookmark file, or as being a list of URL's found outside the server where 
the bookmark resides, or otherwise. The web pages (30) corresponding to 
the URL's contained within the bookmark files (20) are parsed for 
relevant information and a database is constructed based thereon. 
Another database (4 0) may be constructed to associated categories 
contained within found bookmark files with URL's contained within the 
categories. The databases are searched by a user who inputs a query 
, and relevant URL's and other information is displayed. 

French Abstract 

La presente invention concerne un systeme et un precede d'indexage, de 
recherche et de realisation . d ' operations sur 1' Internet, qui utilisent un 
navigateur automatise pour trouver des pages web sur 1' Internet, 
lesquelles sent des fichiers privilegies. Ces pages peuvent etre 
identifiees par le fait qu'elles sont d'un type particulier, tel qu'un 
fichier privilegie <=NETSCAPE>=, ou par le fait qu'elles sont une liste 
d'un URL trouvee en dehors du serveur ou sont localises les fichiers 
privilegies, ou encore d'une facon differente. Les pages web 
correspondant a 1 ' URL contenu dans les fichiers privilegies sont 
analysees en vue d'obtenir des informations pertinentes et on construit 
une base de donnees f ondee sur ces dernieres . On peut construire une 
autre base de donnees destinee a des categories associees contenues dans 
des fichiers privilegies trouves dont 1 ' URL est contenu dans ces 
categories. Un utilisateur recherche les bases de donnees en entrant une 
demande, et les URL pertinents et d'autres informations sont affiches. 

Legal Status (Type, Date, Text) 

Publication 20010215 A2 Without international search report and to be 
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Main International Patent Class: G06F-007/00 
English Abstract 

A system and method for indexing, searching and performing related 
operations on the Internet uses an automated browser (10) to find 
Internet web pages that are bookmark files (20) . These may be 
identified by being of a particular type of file, such as a "NETSCAPE" 
bookmark file, or as... 

...resides, or otherwise. The web pages (30) corresponding to the URL's 
contained within the bookmark files (20) are parsed for relevant 
information and a database is constructed based thereon. Another 
database (40) may be constructed to associated categories contained 
within found bookmark files with URL's contained within the categories. 
The databases are searched by a user who inputs a query , and 
relevant URL's and other information is displayed. 
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Detailed Description 
Claims 

Fulltext Word Count: 4501 
English Abstract 

The present invention relates to an image search method capable of 
expressing one color feature related to a spatial color feature of a 
still image based on a multilevel image grid and similarity-based 
searching images using the thusly expressed multilevel image grid. In the 
present invention, hierarchical grids of different levels are generated 
with respect to one feature for thereby obtaining a data structure in 
which each cell corresponding to the grid is expressed based on a 
reliability on a region representative color and the region 
representative color, so that it is possible to fast and accurately 
search images with respect to a user's content-based query based on a 
cell matching of the same level as two image grids and different levels 
or a color local matching of the grid match. 

French Abstract 

La presente invention concerne un precede de recherche d* image capable 
d'exprimer une caracteristique de couleur se rapportant a une 
caracteristique de couleur spatiale d'une image fixe s ' appuyant sur une 
grille d' image a plusieurs niveau et la recherche d' image a base de 
similitude par utilisation de la grille d* image multiniveau ainsi 
exprimee. En 1 ' occurrence, on genere des grilles hierarchisees de 
differents niveaux par rapport a une caracteristique de facon a obtenir 
une structure de donnees dans laquelle chaque cellule correspondant a la 
grille s'exprime en fonction d'une fiabilite sur une couleur 
representative d'une region et la couleur representative de la region, si 
bien qu'il est possible de rechercher des images, rapidement et avec 
precision, en fonction d'une requete utilisateur a base de contenu, en 
s ' appuyant sur une correspondance des cellules du meme niveau que deux 
grilles d' images et differents niveaux ou une correspondance locale des 
couleurs de la concordance en grille. 



Legal Status (Type, Date, Text) 

Publication 20000914 Al With international search report. 
Publication 20000914 Al In English translation (filed in Korean) . 
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Fulltext Availability: 
Detailed Description 

Detailed Description 

Namely, the color region matching operation between the image grid 
levels (Exact scale matching) of the same size is performed based on a 
method for searching a color region of the same levels from a target 
image. The position is matched with the relative position based on the 
same image grid. . . 

...of the color region is computed, and the position is matched with a 
translation position at the same level of the target image for thereby 
computing a similarity of the color region. 

The color region matching operation between the different image grid 
levels (Inter-scale matching ) is performed based on a method for 
searching the different level color regions among the target images, and 
a similarity of the color region of the same level is computed among the 
different . . . 
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English Abstract 

A method and apparatus is provided for searching a knowledge base -to 
determine whether a search object matches any of a plurality of knowledge 
base entries. When the knowledge base includes a partially matched entry, 
the group of partially matched entries (elements (3, 5, 7, 9, and 11) in 
the figure) is identified. Thereafter, at least one previously unselected 
search object bit is selected and examined to determine whether the group 
of partially matched entries includes a further matched entry the 
represents the same bit pattern in its bit that corresponds to the 
previously unselected search object bit and when it does not, the search 
is terminated indicating no match. 

French Abstract 

Cette invention concerne un precede et un appareil qui permettent 



d'effectuer une recherche dans una base de connaissances, et de 
determiner si I'objet de la recherche correspond a I'une ou 1' autre des 
entrees de la base de connaissances. Lorsque la base de connaissances 
comprend une entree correspondant partiellement , on identifie le groupe 
des entrees correspondant partiellement (Fig. 1, elements 3, 5, 7, 9, 
11). On selectionne ensuite au moins un bit d*un objet de recherche qui 
n'a pas ete selectionne jusqu*alors, et on 1 ' examine afin de determiner 
si le groupe d' entrees correspondant partiellement comprend ou non une 
autre entree correspondante qui possede le meme diagramme binaire dans 
son bit qui correspond au bit de I'objet de recherche non selectionne 
jusqu'alors. Si tel n'est pas le cas, la recherche se termine en 
indiquant 1' absence de correspondance . 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Detailed Description 

Detailed Description 

... the manner in which Don't Cares are implemented by the scattered 
memory access device. 

-13 

DETAILED DESCRIPTION 
The presel 

,@@invention is directed to a method and apparatus for performing 
searches , wherein each search determines a match or conflict 
between a search object and any of a plurality of entries in a 
knowledge base. The searching scheme of the present invention functions 
in many ways like a neural network and therefore, the manner in which it 
operates is described making reference... 
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English Abstract 



A method and apparatus for intelligent Internet searching (16) , the 
apparatus residing as a software application on a user computer (the 
client) (12) . A single subject database is stored on the client and 
accessed by the application. The majority of the single subject database 
content comprises a hierarchical listing of "hidden" web databases, all 
entries being organized by subject matter and each including a 
description of database content and search term entry interface 
customized for the particular database access page format. There are also 
entries in the database which provide an interface to search engines 
hosted at a dedicated search server. Each database is preferably updated 
at a regular interval, such as monthly or weekly, via remote download 
from a server on the WAN, or by other data transport means. A plurality 
of simultaneous hidden database searches may be performed by the 
application by linking the client to the appropriate database access 
pages on the network and forwarding the user desired search information. 
Search results are cached on the user computer for comparison to newly 
found search results, allowing for easy sorting of new and old data and 
differentiated display to the user. Desired keywords are preferably 
cached and shared among database search interfaces. 

French 7\bstract 

Cette invention a trait a un precede et a l^appareil correspondant de 
recherche intelligent dans 1' Internet (16), cet appareil residant comme 
application logicielle dans un ordinateur utilisateur (le client) (12 ) . 
Une base de donnees sujet unique est stockee dans 1* ordinateur client est 
1 ' application y accede. La majeur partie du contenu de cette base de 
donnees comporte un listage hierarchique de bases de donnees cachees du 
Web, toutes les entrees etant organisees par domaine et chacune 
renfermant une description du contenu de la base de donnee et une 
interface d' entree de terme de recherche personnaiisee pour le format de 
page d'acces de base de donnees particuliere . La base de donnees possede 
egalement des entrees qui assurent une interface avec les moteurs de 
recherche inclus dans un serveur de recherche specialise. Chaque base de 
donnee est, de preference, mise a jour a intervalles reguliers, 
mensuellement ou de facon hebdomadaire, par le biais d'un telechargement 
effectue par un serveur sur le reseau etendu ou par un autre moyen de 
transport de donnees. L ' application est a meme d'executer plusieurs 
recherches de bases de donnees cachees en reliant le client aux pages 
d'acces de base de donnees appropriees sur le reseau et en transmettant a 
1 ' utilisateur 1 * information de recherche desiree. Les resultats de la 
recherche sont places dans 1 ' antememoire de 1* ordinateur utilisateur pour 
comparaison avec les resultats de recherche dernierement trouves, ce qui 
facilite le tri des nouvelles et des anciennes donnees ainsi qu'un 
affichage differencie. Les mots-clefs desires sont, de preference, mis 
dans 1 ' antememoire et partages entre les interfaces de recherche de base 
de donnees. 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Detailed Description 

Detailed Description 

... Wide Web (the graphics-capable portion of the Internet), with 
uncounted millions of gigabytes of additional information stored in 
non-Web based, though Web accessible, databases . For the purpose of 
describing the present invention, information obtained through the Web, 
for example presented in Hyper Text Markup Language (IITUL) and available 
at . . . 

...a portal or doorway is found on the Web) is within the hidden or 

"invisible" web and is termed "indirectly accessible." While there are 
numerous search engines and "web crawlers" that may be used to search 
for directly available data on the visible web, there is presently no 
singular source for accessing the indirectly available information on the 
hidden web. The present invention addresses the need for an efficient 
method of finding data on a large scale WAN such as the Internet, 
including the visible and hidden portions of the World Wide Web, and the 
need to efficiently. . . 
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English Abstract 

A method and mechanism for responding to a query in a hierarchically 
organized system of documents and folders. In response to the query, a 
set of documents is retrieved based on specified criteria. Only documents 
in that set which match a specified scope are returned in a result set. 
Scope testing is performed on each of the documents in the set by 
obtaining a document identifier of each document, and then using that 
document identifier to obtain a document identifier of the parent folder 
thereof. The document identifier (80) of the parent folder is used as a 
key to a data structure, which stores flags indicative of whether parent 
folders are in the specified scope. If the flag for a given parent folder 
indicates that the parent folder is in scope, the document having that 
parent is returned in the result set. If the flag indicates that the 
current document is not in scope, that document is not returned. If there 
was not an entry in the data structure for that key, prefix matching is 
performed on the parent folder to determine whether it is in scope. The 
parent folder scope information is then added to the data structure as a 
flag indexed by the document identifier of the parent folder. 

French Abstract 

Precede et mecanisme permettant de repondre a une interrogation dans un 
systeme de documents et de dossiers organise hierarchiquement . En reponse 
a 1 ' interrogation, un ensemble de documents est extrait sur la base de 
criteres specifies. Seuls les documents se trouvant dans cet ensemble et 
correspondant a une cible specifiee sont retournes dans un ensemble 
resultat. Le controle de la cible est effectue sur chaque document de 
1* ensemble grace a I'obtention d'un identif icateur de documents de chaque 
document et ensuite, par utilisation de cet identif icateur de documents 
pour obtenir un identif icateur de documents du dossier parent de ce 
document. L ' identif icateur de documents du dossier parent est utilise 
comme une cle a une structure de donnees qui enregistre des drapeaux qui 
indiquent si les dossiers parents se trouvent dans la cible specifiee. Si 
le drapeau, pour un dossier parent donne, indique que le dossier donne se 
trouve dans la cible, le document ayant ce parent est renvoye dans le jeu 
resultat. Si le drapeau indique que le document actuel ne se situe pas 
dans la cible, ce document n*est pas renvoye. S*il n'existait pas 
d' entree dans la structure de donnees pour cette cle, la correspondance 
prefixee est effectuee sur le dossier parent en vue de determiner si 11 
se trouve dans la cible. Les informations relatives a la presence ou a 



1' absence du dossier parent dans la cible sont alors ajoutees a la 

structure de donnees sous forme d'un drapeau indexe par 1 ' identif icateur 

de documents du dossier parent. 
Main International Patent Class: G06F-017/30 
International Patent Class: G06F-15:40 
Fulltext Availability: 

Detailed Description 

Detailed Description 

query is being processed, the search 
engine searches the index to obtain the documents that 
match the specified restriction without respect to scope. 

To scope test , the search engine performs a string 
comparison, known as prefix matching , on each of the 

documents as they are retrieved to determine which, if any, 
of those files are within the specified scope. Located 
files that have prefixes corresponding to those in the 

query specification are said to be "in scope." Properties 
of those matching files which are in scope are then 
returned in the result set. 

However, string... 
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Claims 

Fulltext Word Count: 667 6 
English Abstract 

A system (1) for indexing and retrieving graphic and sound data ("entity 
data") employs a thesaurus (21) of descriptors (35) and equivalent words 

(e.g., synonyms, metonyms, foreign language equivalents, and 
misspellings) (23) . The system (1) can be used, for example, to index and 
retrieve images, sounds, and related data for social expression cards, 
video segments, multimedia presentations, photographs, audio data, and 
the like. The thesaurus (21) is built by allowing the user to specify a 
number of descriptors (35) and equivalent words ("meaning referent sets") 

(23) for each entity data, incorporating these descriptors (35) and 
meaning referent sets (23) into the thesaurus (21), and linking the 
entity data to the meaning referent sets (23) in the thesaurus (21) by 
using entity identifiers (30) . Entity data can then be retrieved by 
prompting the user to specify a list of descriptors (35), locating the 
descriptors (35) and any associated meaning referent sets (23) in the 
thesaurus (21), and retrieving entity data linked to these meaning 



referent sets (23) . 



French Abstract 

Un systeme (1) servant a indexer et a extraire des donnees graphiques et 
sonores ("donnees d'entite") utilise un thesaurus (21) de descripteurs 
(35) et de mots equivalents (par exemple, des synonymes, des metonymies, 
des equivalents en langue etrangere et des fautes d* orthographe) (23). Le 
systeme (1) peut s'utiliser, par exemple, afin d' indexer et d* extraire 
des images, des sons et des donnees relatives a des cartes de voeux, des 
segments video, des presentations sur supports multiples, des 
photographies, des donnees audio etc. . . Le thesaurus (21) est concu pour 
permettre a 1 ' utilisateur de specifier plusieurs descripteurs (35) et 
mots equivalents ("ensemble de reference de sens") (23) pour chaque 
donnee d'entite, pour incorporer lesdits descripteurs (35) et lesdits 
ensembles de reference de sens (23) et pour relier les donnees d'entite 
auxdits ensembles (23) au moyen de 1 ' utilisation d ' identif icateurs 
d'entite (30). On peut extraire ensuite des donnees d'entite en incitant 
1 'utilisateur a specifier une liste de descripteurs (35), en localisant 
les descripteurs (35) et tout ensemble de reference de sens associe (23) 
dans le thesaurus (21) et en extrayant les donnees d'entite reliees 
auxdits ensembles de reference de sens. 

Main International Patent Class: G06F-015/70 
Fulltext Availability: 

Detailed Description 
Detailed Description 
... thumbnails") to be 

retrieved and displayed by entering graphic data 

attributes in the form of a text description attached 

earlier to each graphic data item. 

Searching for graphic data items by using more 
than one attribute or keyword typically requires the 
use of the logical or Boolean operators AND, OR, and... 

... a method 

to coordinate the storage and retrieval of information 

by using controlled vocabulary terms to link 

functions. Controlled keyword vocabularies are widely 

used in information retrieval systems used to retrieve 

text documents , Various methods to improve the 

effectiveness of such systems have been evaluated 
including the use of some type of thesaurus so the 
terminology of an indexer can be reconciled with that 
of a searcher , The ideal thesaurus for use in such a 
system is considered by some to be one containing any 
term applied by any user to an... a 
postage stamp or a rubber stamp, 
A number of computer-based systems for storing 
and retrieving related linguistic expressions 
(including synonyms) from a large database have been 
invented in the past, including the following. 

Inventor Patent No. Issue Date 
Squillante et al. 51007,019 Apr. 9. 1991 
Kaji et al. . . 
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English Abstract 

A pattern match method is the primary component of any rule-based 
inference engine or database search method. Equivalence class projection 
is used in a discrimination match network, such that only equivalence 
class tokens (and not working memory objects) are propagated down the 
network, then only the first object which is a member of any specific 
equivalence class will cause an actual propagation down through the net. 
Subsequent changes which are either the creation of new objects which are 
members of a known equivalence class or the removal of any object but the 
last member of that equivalence class can totally avoid propagation 
downward in that section of the discrimination network. 

French Abstract 

Un precede de concordance avec un modele represente 1' element principal 
de tout precede de recherche de base de donnees ou de moteur d* inference 
fonde sur des regies. Dans un reseau de concordance de discrimination on 
utilise la projection de classe d ' equivalence, de maniere qu * uniquement 
des jetons de classe d' equivalence (et non pas les objets dans la memoire 
de travail) se propagent dans le reseau, ensuite seulement le premier 
objet faisant partie d'une classe d ' equivalence specifique peut provoquer 
la veritable propagation a travers le reseau. Des modifications 
ulterieures telles que la creation de nouveaux objets faisant partie 
d'une classe d* equivalence connue ou le retrait de n'importe quel objet a 
1' exception du dernier element de cette classe d* equivalence peuvent 
completement empecher la propagation descendante dans cette partie du 
reseau de discrimination. 

Main International Patent Class: G06F-009/44 
Fulltext Availability: 
Detailed Description 

Detailed Description 

... OF FAST PATTERN MATCH DETERMINATION BY EQUIVALENCE 
CLASS PROJECTION MEANS 
Background of the Invention 

This invention relates to pattern matching, such as 

arises in complex data base systems, and more specifically, in 

rule-based artificial intelligence systems (i,e., so-called 

"expert systems") In which the various conditions defined by 

the rules are structured as a network to allow efficient 

determination of which rules are applicable to data obtained 
by the system. 

An expert system is based on a computer program which 

comprises a set of rules and definitions of types of data that... 

...as "inference", the system performs matching between 

conditions set forth in the set of rules and data elements of 
the defined types in a working data base that has been 



obtained by the system from a user or from other data bases . 

When a set of data elements satisfies the conditions in one or 
more of the rules, one of more of the satisfied rules are 
"fired. . . 
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English Abstract 

Methods and apparatus are set forth for identifying the actual 
population of data within computer memory (103-1, 103-2, 103-4, 103-N) 
utilized to support a relational data base, where memory is defined 
herein as being ' 'populated' ' if it is both allocated and actually filled 
with data. By identifying data population (thereby identifying 
unpopulated areas of memory versus simply making a determination of 
memory allocation without regard to how it is being used) , utilities can 
be designed to recover memory resources, data base management techniques 
can be revised to more conservatively allocate memory, etc. Accordingly, 
memory resources can be more efficiently used and in certain instances 
the cost of adding memory to enhance a system can be delayed or 
eliminated, 

French Abstract 

L' invention decrit des precedes et un dispositif servant a identifier la 
population reelle de donnees a 1 ' interieur d'une memoire d'ordinateur 
(103-1, 103-2, 103-4, 103-N) utilisee pour supporter une base de donnees 
relationnelle, ou la memoire est definie comme etant ''peuplee'*, si elle 
est a la fois attribuee et remplie reellement par des donnees. Au moyen 
de 1 ' identification de la population de donnees (de ce fait, par 
identification de zones non peuplees de la memoire par rapport au simple 
etablissement de la determination de 1 ' attribution) , on peut concevoir 
des programmes utilitaires, afin de recuperer des ressources de memoire, 
on peut reviser des techniques de gestion de base de donnees, afin 
d'attribuer la memoire de facon plus productive, etc. De cette facon on 
peut utiliser plus efficacement les ressources de la memoire et, dans 
certains cas, on peut retarder ou eliminer les couts occasionnes par une 
memoire supplementaire servant a 1* extension d'unteme. 

Main International Patent Class: G06F-015/403 
Fulltext Availability: 
Claims 

Claim 

1* A method for performing relational data base memory 
utilization analysis, comprising the steps of: 



(a) selecting a set of data base files to be analyzed; 

(b) specifying a set of relations located within said 
set of data base files for which memory 
utilization information is to be determined; and 

(c) determining tuple usage for each relation in said 
set of relations. 

2e A. . . 

.does not have a data page 

structure that includes a status bit map. 
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4* A method for performing relational data base memory 
utilization analysis, comprising the steps of: 

(a) selecting a set of data base files to be analyzed; 

(b) specifying a set of relations located within said 
set of data base files for which memory 
utilization information is to be determined; 

(c) determining which of the relations in said set of 
relations has a data page ... comprising the 

steps of: 

(a) transforming tuple usage information, for each 
relation in said set of relations and for each 

file in said set of data base files, into a set of 
signals indicating memory population; and 

(b) outputting said set of signals indicating memory 
population to thereby provide an analysis of data 

base memory utilization. 
9* A method as set forth in claim 8 wherein said relational 

data base memory is included in a set of Office Dependent 
Data (ODD) files created by a telephone switching system. 
10a A method as set forth in. , . 

.tuple in each 
relation . 

lie A method for identifying the actual population of data 
within computer system memory utilized to support a set of 
relational data bases , comprising the steps of: 

(a) creating a list of files, wherein the set of files 
on said list constitute a preselected subset of 

said set of data bases and further wherein each 
file on said list contains a preselected set of 
relations to be analyzed; 

(b) creating a list of said preselected set... tuple on each 
data page in each such relation. 

SUBSTITUTE SHEET 

13* A method as set forth in claim 12 wherein said set of 
relational data bases is included in Office Dependent Data 
(ODD) files created by a telephone switching system that 
includes an administrative module, a communications module 
and a set . . . 

•file to provide an error 

indication whenever the page identifier for a given page is 
outside the range of the switch module within which the page 
is properly located . 

19* A method for performing memory population analysis on a 
relational data base , wherein the contents of said data base 
include a first set of relations each having an associated 
status bit map for indicating tuple usage within a given 
relation, and a second set... 

.that does not maintain 

an associated bit map, comprising the steps of: 

(a) inputting, to a digital computer having access to 

the contents of said data base , a first set of 

signals specifying the relations to be analyzed; 

oil 
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(b) inputting, to said digital computer, a Second set 

of signals ... tuples within each such relation by directly 

analyzing the contents of each tuple. 

"ET 

SU%= MOLM 

21* A method for analyzing a set of relational data bases 
that are stored in the memory of a set of switch modules 
included in a telephone switching system, wherein said set 
of relational data bases comprise static protected Office 
Dependent Data (ODD) that is classified as non-redundant if 
required for call processing based on special features and 
services required. . . 

...next ODD file on said 

list of ODD files as the ODD file to be analyzed 
if any such files remain and terminating the 
relational data base analysis if no further ODD 
files remain to be processed. 
SUBSTITUTE SIFqEeT 

22 A method as set forth in claim 21 wherein said memory 
population. . . 
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Claims 
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English Abstract 

This invention enables a user (5) to obtain information from a large 
number of commercial databases (20-24). In practicing the invention, the 
user (5) selects an area of interest (40) and enters a search request 
(42) . The search request (42) includes at least one word for which the 
user (5) desires to search. In one embodiment of the invention, the 
system selects a set of at least two databases (two of 20-24), 
automatically executes the search request (46) in each database, and 
presents the results to the user (5, 48). In another aspect of the 
invention, the user (5) selects a database directly, and employs a set of 
standardized commands (see figure 4) for any database selected. The 
system translates these standardized commands into the equivalent 
commands recognized by each database, without the intervention or 
knowledge of the user (5) . The user (5) can thus communicate with a 
variety of databases (20-24) using the same command set (see figure 4). 
In another embodiment, the invention guides the user (4) in reformulating 
a search which retrieved either no documents or too many documents. The 



invention also includes a method of determining which of the retrieved 
documents are likely to be the most relevant. 
French Abstract 

La presente invention permet a un utilisateur (5) d'obtenir des 
informations a partir d'un grand nombre de bases de donnees commerciales 
(20-24). En appliquant la presente invention, 1 * utilisateur (5) 
select ionne une zone d'interet (4 0) et introduit une demande de recherche 
(42) , La demande de recherche (42) contient au moins un mot que 
1 ' utilisateur (5) desire rechercher. Dans un premier mode de realisation 
de la presente invention, le systeme selectionne un groupe d'au moins 
deux bases de donnees (deux parmi les bases 20 a 24), execute 
automatiquement la demande de recherche (4 6) dans chaque base de donnees 
et presente les resultats a 1 * utilisateur (5, 48). Dans une variante de 
la presente invention, 1 ' utilisateur (5) selectionne une base de donnees 
directement et utilise un groupe d'ordres standardises (voir figure 4) 
pour n'importe quelle base de donnees selectionnee . Le systeme traduit 
ces ordres standardises en ordres equivalents reconnus par chaque base de 
donnees, sans intervention ou connaissance de la part de 1 ' utilisateur 
(5). L * utilisateur (5) peut ainsi communiquer avec une variete de bases 
de donnees (20-24), en utilisant le meme groupe d* ordres (voir figure 4). 
Dans un autre mode de realisation, 1' invention guide 1 * utilisateur a 
reformuler une recherche qui soit n'a extrait aucun document soit a 
extrait trop de documents. La presente invention se rapporte egalement a 
un procede permettant de determiner lesquels parmi les documents extraits 
sont susceptibles d'etre les plus pertinents. 

Main International Patent Class: G06F-015/403 
Fulltext Availability: 
Detailed Description 

Detailed Description 
. . . such as by 

imposing more stringent field or proximity restrictions on portions of 
the search request. 

The invention also includes a system and method for determining 
which retrieved documents are likely to be the most relevant. This 

method is performed without browsing through the actual documents. In 
stead, the system considers, for example, the fields in which the search 

terms were found. Thus, if a search term is found in the title of the 
document, it is likely that the document is more relevant than if the 
SUBSTITUTE SHEET 
term appeared. . . 
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Systems and methods for representing variable-length digit strings and for 

providing high-performance search operations 
Sytemen und Verfahren zxam Representieren von Ziffernfolgen variabler Lange 

und zum Bereitstellen eines Hochleistungssuchvorgangs 
Systemes et pro cedes pour repres enter des chaines de chiffres a longueur 

variable et pour permettre une operation de recherche de haute 

performance 
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ABSTRACT EP 1202164 A2 

Systems and methods providing high performance data manipulation and 
searching using a novel data structure for storing data values. The data 
structure includes a length value in combination with a digit-string data 
value. The combined length and data value can be stored in a data 
structure that may be sorted according to standard sorting techniques. 
When a set of data structures is sorted, the resulting order of the data 
values can be employed to increase performance of the data engine. In one 
embodiment, the length value is stored in the end portion of the data 
structure and in another embodiment, the length portion is stored in the 
beginning portion of the data structure. Range-based searches and 
ambiguity checking operations are also presented herein. 
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. . . G06F-017/30 

. . .SPECIFICATION INVENTION 

Embodiments of the present invention provide systems and methods for 
construction and execution of computing engines for processing and 
storing variable-length digit string data and matching presented 
values against designated ranges of values, while providing: (1) 
efficient storage of digit strings , (2) rapid editing of digit 
strings, and (3) rapid searching of tables indexed by such digit 
strings. Embodiments of the present invention provide systems and methods 



for efficient representation of digit strings, such as for... 
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ABSTRACT EP 1128278 Al 

A computer-implemented method of generating an input to be used by a 
classifying apparatus based on an electronic document comprising a 
plurality of elements, said method comprising: analyzing said electronic 
document to obtain one or more of said elements together with information 
about their corresponding position in said document; generating an 
electronic layout document to be used as said input of said classifying 
apparatus, said electronic layout document comprising: a representation 
of a plurality of said elements obtained in said analyzing step together 
with information representing their absolute and/or relative position in 
said electronic document. 
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...CLAIMS further comprises one or more of the following: 

searching for elements in said document which match a certain format 
criterion; 

searching for words in said document which match a certain string 

comparison criterion; 
performing a fault tolerant word search ; 
performing a search for an element which has a predefined relative 

position with respect to a found candidate; 
performing a database search for searching elements searcher searching 

for elements in said document which match a certain format criterion; 

a word searcher searching for words in said dociiment which match a 

certain string comparison criterion; 
a word searcher performing a fault tolerant word search ; 
an element searcher performing a search for an element which has a 

predefined relative position with respect to a found candidate; 
a database searcher performing... 

...CLAIMS said step of searching candidates further comprises one or more 
of the following: 

searching for elements in said document which match a certain format 
criterion; 

searching for words in said document which match a certain string 

comparison criterion; 
performing a fault tolerant word search ; 
performing a search for an element which has a predefined relative 

position with respect to a found candidate; 
performing a database search for searching elements ... the following: 
an element searcher searching for elements in said document which match 

a certain format criterion; 
a word searcher searching for words in said document which match a 

certain string comparison criterion; 
a word searcher performing a fault tolerant word search ; 
an element searcher performing a search for an element which has a 
predefined relative position with respect to a found candidate; 
a database searcher performing. . . 
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ABSTRACT EP 959420 A2 

A technique is provided for retrieving information from a plurality of 
documents (1) in a target language using a query (11) in a source 
language. The query (11) is converted into the target language using a 
multilingual resource (21-29) . A query in the target language is 
formulated (30) from the translated terms of the source language query 
and is applied (31) to an information management system (31) , When 
matches are found between the query and the contents of documents in the 
information management system (31), each document or a part thereof is 
converted (33) back into the source language using the same multilingual 
resource . 
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...SPECIFICATION searching by character strings for useful documents or 
files and selecting the most appropriate translation environment (such as 
a glosser or machine translation system) for located dociaments on the 
basis of the character string . Any translation which occurs is 
performed exclusively on located documents by the most appropriate 
translation environment for the subject matter as identified by the 
character string. 

WO 97/18516 is specifically concerned with translating Web... 

...SPECIFICATION such techniques. 

GB 2 320 773 relates to an automatic translation technique which is 
principally intended for use on the Internet. It is based on searching 
by character strings for useful documents or files and selecting the most 



appropriate translation environment (such as a multilingual resource or 
machine translation system) for located documents on the basis of the 
character string . Any translation which occurs is performed 
exclusively on located documents by the most appropriate translation 
environment for the subject matter as identified by the character string. 

WO 97/18516 is specifically concerned with translating Web. . . 
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systems 
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Provided is a method for reproducible sampling of data items of a 
dataset which is shared across a plurality of nodes of a parallel data 
processing system. 

In data mining of large databases, segmentation of the database is 
often necessary either to obtain a summary of the database or prior to an 
operation such as link analysis. A sample of data records are taken to 
create an initial segmentation model. The records of this sample and the 
initial model created from them can be critical to the results of the 
data mining process, and the initial model may not be reproducible unless 
the same sampling of data records is repeatable . Reproducible sampling is 
enabled without polling of all nodes to locate particular records, by 
providing each node with a small number of control parameters which 
describe the particular partitioning of the dataset and so enable 
computing of the location of a data record to allow for changes to the 
partitioning. The invention is applicable to other sampling methods. 
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...SPECIFICATION specific sequence and performing operations on the 

selected data, while ensuring adequate coverage of the data on each of 
the nodes, then a fast and efficient method is required for locating 
the required data items. 

One possible method of locating specific data items within a 
dataset which is shared across multiple nodes involves polling of all the 
individual nodes. A first node (generally a controller node) sends a 
query to all nodes to determine which has, say, item number 15 of the 
set of data items. One of the nodes should reply with a... 
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ABSTRACT EP 709788 Al 

Method and system for selectively retrieving information contained in 
a stored document set using a metric-based or "fuzzy" finite-state 
non-deterministic automaton. An automaton is constructed (501) 
corresponding to a text string query, text strings are read (502) from 
storage and corresponding dissimilarity values are generated (505) . 
Those strings resulting in values less than a given threshold are 
recorded (508) and listed for the user. Dissimilarity values are 
determined based on penalties associated with missing characters, extra 
characters, incorrect characters, and other differences between the text 
string query and a text string read from storage. (see image in 
original document) 
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...SPECIFICATION document text string in coordination with the application 
of other rules to create a text string that matches the pattern 
specified by the text string query , k is the number of rules, w( 
sub(n)) is the non-negative weight assigned to each rule , and the 
minimization is performed over all possible sequences of rule 
applications that could successfully convert the dociment text 
string to match the pattern of the text string query . 

The first set of rules stored in memory 106 is a set of manipulations 
that may be considered in the aggregate to modify the document... 
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The method of the present invention combines concept searching, 
document ranking, high speed and efficiency, browsing capabilities, 
"intelligent" hypertext, document routing, and summarization (machine 
abstracting) in an easy-to-use implementation. The method of the present 
invention also offers Boolean and statistical query options. The method 
of the present invention is based upon "concept indexing" (an index of 
"word senses" rather than just words.) It builds its concept index from a 
"semantic network" of word relationships with word definitions drawn from 
one or more standard human-language dictionaries. During query, users may 
select the meaning of a word from the dictionary during query 
construction, or may allow the method to disambiguate words based on 



semantic and statistical evidence of meaning. This results in a 
measurable improvement in precision and recall. Results of searching are 
retrieved and displayed in ranked order. The ranking process is more 
sophisticated than prior art systems providing ranking because it takes 
linguistics and concepts, as well as statistics into account, (see image 
in original document) 
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...SPECIFICATION is that speed and accuracy go hand in hand rather than 
compete during the retrieval. By integrating the ranking rules with the 
document indexes, the method of the present invention can efficiently 

search for the best matching document , even when the request 
contains dozens of search terms. The net effect is the nearly instant 
and accurate response to the user before all the searching is complete 

This substantially reduces the response time to the user and presents 
the best or approximately the best document first, not just any document 
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A compact character string retrieving system (1) capable of producing 
correctly the result of matching (205) without omission even upon 
occurrence of multiple matching (306) in which a plurality of search 
terms are matched for one character string (at 280) by a finite 
automaton. A destination state for transition brought about by a trailing 
character of the search term is newly created instead of an initial 
state. A transition table storage (220) stores the destination state. On 
the basis of the source state number and a specified pattern character 
code, the destination state number is read out from the state transition 
table storage (220) . When the state number read out represents the 
destination state of the transition brought about by the trailing 
character of the specified pattern character string, an identifier 
thereof is outputted. The identifiers of the search terms matched are 
each represented by one bit information, and a group of corresponding 
flags is stored in one slot. Multiple matching (280) can be performed 
without omission. The character string retrieving system is implemented 
in a reduced size, (see image in original document) 
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...CLAIMS said state number read-out means (250) represents the destination 
state number for the trailing character, said character string 
identifying means (261) assigns one bit information to the result 
of matching performed on said search character string , 
wherein a matching identifying flag for identifying said search 
character string is outputted in dependence on the bit position of 
said one bit information. 
4. A system according to claim 1, further comprising: 
multiple . . . 
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Detailed Description 

Claims 

Fulltext Word Count: 10220 
English Abstract 

A method and system for providing distributing a query to devices on a 
communications network, such as servers on the internet, using an 



application that can survey a subscriber's server that can include text 
documents and databases, and use the formatting information and data from 
the survey to create at least one dictionary customized to the 
subscriber's data sources. A user seeking information can initiate a 
search or query from an initiating device using keywords, natural 
language terms, connectors, expressions, etc., and the query can be 
transmitted to various subscriber customized dictionaries. The customized 
dictionaries can customize the query based on respective subscriber 
databases and text documents, text search engines, etc., to produce and 
customized query result. The query results can be filtered and integrated 
for presentation to the initiating device. Search results can be 
customized using user preference of profile information. 

French Abstract 

La presente invention concerne un precede et un systeme permettant de 
distribuer une demande a des dispositifs de reseau de communication, tels 
que des serveurs sur Internet, par 1 ' utilisation d'une application qui 
peut sonder un serveur d'abonne pouvant comprendre des documents texte et 
des bases de donnees. Ce systeme utilise ces donnees et informations de 
formatage en provenance du serveur pour creer au moins un dictionnaire 
individualise avec les sources de donnees de 1 ' abonne . Un utilisateur qui 
recherche des informations peut lancer une recherche ou une demande a 
partir d*un dispositif de lancement par mots cle, par termes de langage 
naturel, par connecteurs, par expressions etc., et cette demande peut 
etre transmise a divers dictionnaires individualises d'abonne. Ces 
dictionnaires individualises peuvent individualiser la demande fondee sur 
des bases de donnees et des documents texte d'abonne respectifs, des 
moteurs de recherche de texte, etc., de facon a produire et 
individualiser un resultat de demande. Les resultats de demande peuvent 
etre f litres et integres en vue d'une presentation au dispositif de 
lancement. On peut individualiser les resultats de recherche en utilisant 
une preference de 1 ' utilisateur ou une information de profil utilisateur. 

Legal Status (Type, Date, Text) 

Publication 20020207 A2 Without international search report and to be 

republished upon receipt of that report. 

Examination 20020404 Request for preliminary examination prior to end of 

19th month from priority date 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Detailed Description 

Detailed Description 

DISTRIBUTED SEARCH SYSTEM AND METHOD 
BACKGROUND 

(1) Field 

The systems and methods relate generally to search systems and methods 
, and more particularly to efficiently locating and accessing 
information across a communications network such as the internet. 

(2) Description of Related Art 

The internet provides a single conduit to reach a supplier of a... 
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Detailed Description 
Claims 

Fulltext Word Count: 17473 
English Abstract 

A concept-based indexing and search system (100) indexes collections of 
documents (180) with ontology-based predicate structures (120, 140) 
through automated and/or human-assisted methods. The system extracts the 
concepts behind user queries to return only those documents that match 
those concepts. The concept based search and retrieval system comprehends 
the intent behind a query from a user, and returns results matching that 
intent. The system can perform off-line searches for unanswered user 
queries and notify the user when a match is found. 

French Abstract 

Ce systeme d'indexation et de recherche (100), base concepts, repertorie 
des collections de documents (180) au moyen de structures de predicats 
basees ontologie (120, 140), et par le biais de precedes automatiques 
et/ou assistes par I'homme. Ce systeme extrait les concepts caches 
derriere les demandes utilisateur, de maniere a ne renvoyer que les 
documents qui correspondent a ces concepts. Ce systeme de recherche et 
d* extraction, base concepts, apprehende 1' intention cachee derriere une 
demande emanant d*un utilisateur et renvoie les resultats correspondant a 
cette intention. Ce systeme peut executer des recherches hors ligne pour 
des demandes utilisateur n'ayant pas encore recu de reponse et informe 
1 ' utilisateur lorsqu'une correspondance a ete trouvee . 
Legal Status (Type, Date, Text) 

Publication 20020207 Al With international search report. 

Examination 20021205 Request for preliminary examination prior to end of 

19th month from priority date 

Main International Patent Class: G06F-017/30 
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Fulltext Availability: 
Claims 



Claim 



1 A method of performing concept-based searching of text documents 
comprising the steps of transforming said text documents into predicate 
structures to form predicate 
libraries of said documents; 
inputting a natural language query ; 

creating a query predicate structure representing logical relationships 
between an argument; matching said query predicate structure to said 
document predicate structures in 
said predicate libraries; and 

presenting said matched predicate structures from said text documents 



2 A method of performing concept -based searching of text documents 
as recited in claim 1, wherein said predicate is one of a verb and a 
preposition. 

3 A method of performing concept... 
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English Abstract 

A user of an information network (Fig 1} is provided with information 
contained within at least one of plurality of network sites (24) in view 
of a user search criteria by searching a database (20) that is populated 
with a plurality of other search criteria provided by other network users 
and a plurality of search strategies. Each of the search strategies 



includes information regarding network sites likely to contain 
information relevant to the other search criteria. The other search 
criteria within the database are linked with the search strategies. At 
least one search strategy is selected from the database based upon the 
user's search criteria. The search strategy is executed and the results 
are provided to the user in one embodiment, users can employ software 
agents (22) to assist in searching, collecting and reporting on data in 
accordance with the user's specified criteria. 

French Abstract 

L' invention concerne un precede fournissant, a un utilisateur de reseau 
d ' informations , des informations contenues dans au moins une pluralite de 
sites de reseaux (24) en fonction d'un critere de recherche 
d'utilisateur . Le precede consiste a fournir une base de donnees (20) 
renfermant une pluralite d'autres criteres de recherche fournis par 
d'autres utilisateurs de reseaux, et une pluralite de strategies de 
recherche. Chaque strategie de recherche comprend des informations 
relatives aux sites de reseaux contenant eventuellement des informations 
correspondant aux autres criteres de recherche. Les autres criteres de 
recherche de la base de donnees sont lies aux strategies de recherche. On 
selectionne au moins une strategie de recherche dans la base de donnees 
renfermant les criteres de recherche d' utilisateurs . On execute ladite 
strategie de recherche et, selon un mode de realisation, les resultats 
sont fournis a 1 ' utilisateur . Les utilisateurs peuvent utiliser des 
agents logiciels (22) pour les assister dans leur recherche, collecter 
des donnees, et fournir un compte rendu de donnees en fonction des 
criteres specifies par 1 * utilisateur . 
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19th month from priority date 
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Detailed Description 

... a search conducted by a party, would remain unknown to the party 
unless the search engine search process is repeated. 

Hence, those concerned with obtaining information located within an 
information network have long felt the need for a method of 
efficiently and actively searching the network for information in 
response to a search criteria. The need has also been long felt for a 
method of searching a network without requiring the continued 
participation of the party on whose behalf the search is being 
conducted. The present invention clearly fulfills these needs and others, 

SUMMARY OF THE INVENTION 

Briefly, and in general terms, the present invention is... 



17/5, K/62 (Item 29 from file: 349) 

DIALOG (R) File 34 9:PCT FULLTEXT 

(c) 2003 WIPO/Univentio. All rts. reserv. 



00821289 

A SYSTEM AND METHOD FOR MATCHING REQUESTS FOR INFORMATION WITH SOURCES 
THEREOF 

SYSTEME ET PROCEDE PERMETTANT D'ASSOCIER DES DEMANDES D • INFORMAT IONS AUX 
SOURCES DE CES DERNIERES 

Patent Applicant/Assignee: 

NEURONIA LTD, Rival Street 7, 67778 Tel-Aviv, IL, IL (Residence), IL 
(Nationality), (For all designated states except: US) 
Patent Applicant/Inventor: 

GEVA Ori, Neuronia Ltd., Rival Street 7, 67778 Tel-Aviv, IL, IL 
(Residence), IL (Nationality), (Designated only for: US) 



ISH-HURWITZ Shai, Neuronia Ltd., Rival Street 1, 61118 Tel-Aviv, IL, IL 
(Residence), IL (Nationality), (Designated only for: US) 

HOCHBERG Assaf, Neuronia Ltd., Rival Street 7, 67778 Tel-Aviv, IL, IL 
(Residence), IL (Nationality), (Designated only for: US) 
Patent and Priority Information (Country, Number, Date) : 

Patent: WO 200153970 A2 20010726 (WO 0153970) 

Application: WO 2001IB268 20010124 (PCT/WO IB0100268) 

Priority Application: US 2000177355 20000124 
Designated States: AE AG AL AM AT AU AZ BA BE BG BR BY BZ OA CH CN CR CU CZ 

DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ 

LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG 

SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW 

(EP) AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR 

(OA) BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG 

(AP) GH GM KE LS MW MZ SD SL SZ TZ UG ZW 

(EA) AM AZ BY KG KZ MD RU TJ TM 
Main International Patent Class: G06F-017/00 
Publication Language: English 
Filing Language: English 
Fulltext Availability: 
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Claims 
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English Abstract 

A platform for mass information exchange by building a content rich 
virtual community that enables information to be exchanged by matching 
information seekers with information sources within a chat oriented 
environment. More specifically, a user searching for specific information 
is matched with a person who can provide the information or with an 
automated information channel. The user inputs a query into the system 
wherein the query is parsed and compared to a database of profiles 
defining a person or automated information channel as a source of 
information. When a match is made between a query and a source, the query 
is transmitted to the identified source. The source then provides a 
response to the query to the user. The system allows users to reach 
realms of personal knowledge repositories never before available by 
accessing the knowledge contained in another's mind, not merely the 
knowledge physically placed on the Internet. 

French Abstract 

L' invention concerne une plate-forme permettant un echange d ' informations 
en masse par la construction d'une communaute virtuelle riche en contenu 
qui permette 1' echange d ' informations par 1 ' association de chercheurs 
d' informations a des sources d' informations dans un environnement de 
forum de discussion. Plus specialement , un utilisateur recherchant une 
information specifique est associe a une personne pouvant fournir 
1 * information, ou a un canal d ' informations automatise. L ' utilisateur 
introduit une demande dans le systeme dans lequel elle est analysee et 
comparee a une base de donnees de profils definissant une personne ou un 
canal d ' informations automatise, tel qu*une source d ' informations . 
Lorsqu*une association est effectuee entre une demande et une source, la 
demande est transmise a la source identifiee. La source fournit ensuite 
une reponse a la demande de 1 * utilisateur . Le systeme permet a des 
utilisateurs d'acceder a des connaissances personnelles qui n'avaient, 
jusqu*alors, jamais ete disponibles, en accedant aux connaissances 
contenues dans 1' esprit d'une autre personne, et non plus simplement aux 
connaissances physiquement introduites sur Internet. 

Legal Status (Type, Date, Text) 

Publication 20010726 A2 Without international search report and to be 

republished upon receipt of that report. 

Main International Patent Class: G06F-017/00 
Fulltext Availability: 
Detailed Description 



Detailed Description 



matrix ensures that every 
possibility for matching a term to an individuals' profile thus matching 
an information seeker with an information source is exploited by 

searching the intersection of each row and column for a specified 
keyword. For example, the system of the present invention searches 
for a match between the keyword "Super Bowl" and the profiles 
contained in the database . Namely, the matching process will begin in 
the Al cell of the matrices wherein the actual query term filtered from 
the user's question is matched against the profile categories. If a 
match is made, the system will forward the user's question to the 
source of information associate with the matching profile . The 
matching process will continue performing searches among the 
intersecting cells of the matrices. For example, when the matching 
40 

process encounters cell B4 the expansion terms will be matched 
against the information contained in the profile descriptions. In this 
case, the matching process may be searching or attempting to match 
numerous terms as a result of the functional step of the expansion 
process. Thus, the matching process may be searching for a profile 
description that pertains to football, sports, NFL or football 
(American) . 

As results are returned, the results from each cell searching process 
are . . . 
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English Abstract 

This invention relates to a method of matching computer-based records 
(301) for identifying unique entities (303) both within and between 



disparate data files. This method of record-linkage has particular 
utility in the fields of epidemiology and health services research. 



French Abstract 

La presente invention concerne un procede permettant d'apparier des 
enregistrements inf ormatiques (301) afin d* identifier des entites uniques 
(303) tant a I'interieur de fichiers de donnees disparates qu'entre 
ceux-ci. Le procede d ' appariement d' enregistrements de 1' invention est 
particulierement utile dans le domains de 1 * epidemiologie et de la 
recherche sur les services de sante. 

Legal Status (Type, Date, Text) 

Publication 20010525 Al With international search report. 
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Detailed Description 

marriages, census results and others apparently linking some sources, 
to sustain his familial studies of deafness (Bruce, 1973; Bell, 1906) . 
For many applications involving multiple databases , enough 
inf ori-nation is present to allow an accurate human ' udgement about 
whether a record from one source refers to the same case as a record from 
other sources. However, this is an extremely time-consuming, error-prone, 
and unreliable method except for small data sets. Computer methods are 
necessary to perform this task for a record matching exercise to be 
cost effective. 

- I 

Summarv of the Invention 

The present invention is a computer-implemented system and method for 
creating a universal identifier. . . 
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English Abstract 

A computer implemented method for detecting and scoring similarities 
between documents in a source database (23) and a search criteria (15) . 
It uses a hierarchy of parent and child categories to be searched, 
linking each child category with its parent category. Source database 
(23) documents are converted into hierarchical database documents (24) 
having parent and child objects with data values organized using the 
hierarchy of parent and child categories to be searched. 

French Abstract 

Cette invention concerne une technique informatique de detection et de 
notation de similitudes entre des documents dans une base de donnees 
source (23) ainsi que des criteres de recherche (15) . Cette technique 
fait intervenir une hierarchie de categories mere et fille a etudier en 
reliant chaque categorie fille a sa categorie mere. Des documents de la 
base de donnees source (23) sont convertis en documents (24) de base de 
donnees hierarchique qui comportent des objets mere avec des valeurs de 
donnees agencees ,au moyen de la hierarchie de categories mere et fille a 
examiner . 



Legal Status (Type, Date, Text) 

Publication 20010329 Al With international search report. 
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19th month from priority date 
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Fulltext Availability: 
Claims 

Claim 

... user interface. 

4 8 The method of claim 4 4 wherein the scoring method is algorithmic. 

4 9 The method of claim 4 4 wherein the scoring method generates search 
criteria values and compares the data values to the search criteria 
values and if a match occurs, a number is saved in a score buffer for the 

data value that 
represents a match . 

50 The method of claim 20 further comprising performing cross 
database 

searching using the same schema and query , repeating claim 20, steps 

a 

through d for each of N number of source databases and allowing a user 
to 

view a result for each database . 



51 The method of claim 50 further comprising displaying the search 
criteria and the results for the N source databases on a user's computer 
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English Abstract 

A method of performing a sliding window search includes the steps of : 
creating an associative database of a plurality of data strings (42), 
receiving a first window of a data block (44), iconizing the first window 
of the data block to form a first icon (46), determining if the first 
icon has a match in the associative database (48), determining a first 
byte icon of a first byte of data in the first window (50), executing an 
icon shift function to form a shifted first byte icon, (52) exclusive 
ORing the shifted first byte icon with the first icon to form a seed icon 

(54), determining a second icon for a second window using the seed icon 
and transforming a new byte of data onto the seed icon (56), and 
determining if the second icon has a match in the associative database 

(58) . 

French Abstract 

La presente invention concerne un procede de recherche a fenetre 
glissante qui consiste (a) a creer une base de donnees associative d*une 
pluralite de chaines de donnees, (b) a recevoir une premiere fenetre sur 
un bloc de donnees, (c) a iconiser la premiere fenetre du bloc de donnees 
afin de former une premiere icone, (d) a determiner si la premiere icone 
possede une correspondance dans la base de donnees associative, (e) a 
determiner une premiere icone de bit d*un premier bit de donnees dans la 
premiere fenetre, (f) a executer une fonction de decalage d' icone afin de 
former un premier icone de bit decale, (g) a effectuer un OU exclusif 
entre 1' icone de premier bit decale et la premiere icone afin de former 
une icone semence, (h) a determiner une seconde icone pour une seconde 
fenetrer a I'aide de 1* icone semence et a transformer un nouveau bit de 
donnees sur 1' icone semence, et (i) a determiner si la seconde icone 
possede une correspondance dans la base de donnees associative. 

Main International Patent Class: G06F-007/02 
Fulltext Availability: 
Detailed Description 

Detailed Description 

... as searching for a single three byte string of data. 

This is because each new data string just represents a different 
entry in the associative database 26. Whereas standard compare 
functions would have to perform a comparison for each data string 
being searched . Thus this invention is particularly helpful where 



numerous data strings need to be matched 



Often the data strings for which we are searching have 
differing lengths. In one embodiment this is handled by defining a 
separate window search size (e.g., W2-, 24). The two or more 
window sizes operate completely independently as described 
above. In another embodiment, the associative database 26 
contains a qualified match for a first portion of each the data 
strings that are longer than the window length. Note in this case 
the window length (window size) is selected to be equal to the 
shortest data string being searched . When the process encounters 
a qualified match, two alternative implementations are possible. 

In one implementation, there is a pointer 34 associated with the 
qualified match. . . 
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METHOD AND APPARATUS FOR ACCESSING ON-LINE STORES 
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Fulltext Word Count: 14116 
English Abstract 

This invention provides a computer-implemented agent (5) that assists a 
user (1) in accessing network linked (4 and 6) on-line stores (7) . In one 
aspect, the invention is a method for intelligently routing a user query 
to on-line stores (7) relevant to that query, extracting relevant data 
fields from received responses, and intelligently presenting the 
extracted data in order of estimated interest. In another aspect, the 
system of this invention implements one or more steps of the method in a 
centralized or distributed manner on one or more network linked computers 
(4 and 6) . Further, this invention provides a novel heuristically guided 
process by which the agent is capable of automatically acquiring 
sufficient information on the characteristics of on-line stores (7) for 
it to access and shop at those stores. 

French Abstract 

1' invention concerne un agent informatise qui aide un utilisateur a 
acceder a des boutiques en direct reliees a un reseau. Un aspect de cette 
invention concerne un precede qui permet d'acheminer intelligemment une 
demande d' utilisateur jusqu'a des boutiques en direct propre a la 
demande; d'extraire les domaines de donnees pertinents a partir des 
reponses recues et de presenter de maniere intelligente les donnees 
extraites par ordre d'interet estime. Un autre aspect de cette invention 
concerne le systeme de 1' invention qui met en oeuvre une ou plusieurs 
etapes du precede de maniere centralisee ou decentralisee sur un ou 
plusieurs ordinateurs relies a un reseau. Cette invention concerne 
egalement un nouveau processus guide de maniere heuristique qui permet a 
1' agent d'acquerir automatiquement des informations suffisantes 



concernant les caracteristiques des boutiques reseautees en direct pour 
qu'il puisse acceder a ces memes boutiques et y faire ses achats. 



Main International Patent Class: G06F-017/30 
Fulltext Availability: 

Detailed Description 

Claims 

Detailed Description 

... typically takes 515 minutes-per vendor. 

5o4o SHOPPING AT ON-LINE STORES 
TABLE 2 - A VENDOR DESCRIPTION 

The URL of a page containing a product search 
form, and optionally an encoding of the preferred 

search form on this page 
Pairs of strings, each pair being an attribute 
name and a field name, for mapping of product 

query attributes to fields of that form. 
Strings used by parsing functions for extracting 
product data from pages returned from the on-line 
store by matching portions of the returned pages, 
including 

A string that matches a unique portion of a 

search failure page {e,g,, "Product not 
found") , 

Header and trailer strings that match mere 
header and trailer formatting information 
used in search success pages . 

An abstract format string that matches the 
components of product descriptions in search 
success pages and is used to extract 
information from those descriptions. 

The output of the first learning phase is an on-line 
store description, which. . . 

Claim 

. . . the other string of each said pair 

being the name of a fill-in field of said product query form; 
C. a second string for matching product search failure 
20 pages from said on-line store; 

d, a third string for matching header information on 
successful product search pages from said on-line store; 

e, a fourth string for matching trailer information of 
said successful product search pages; and 

f, a fifth string for matching and extracting product 
information from said successful product search pages, said 

fifth string representing the abstract format of said product 
information, 

30 15, The method of claim 14 further comprising prior to said 
step of fetching the additional... 

. . .of : 

a. fetching a plurality of pages from an on-line store 
without a current on-line store description; 

b, selecting a plurality of candidate query forms from 
35 said plurality of pages; 

C, for each candidate query form determining a mapping 
of product attributes to fill-in fields of each candidat@e 
46 

query form, querying said on-line store with said query form 
filled-in with dummy products and with popular products to 
further determine said first, second, third, fourth, and 
fifth strings and said plurality of pairs of strings, and 
5 estimating ranking of the query success of this query form; 
and 



d, selecting the candidate query form and associated 
strings and pairs of strings with the highest ranking as the 
on-line store description; and 

16 The method of claim 15... 
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Claims 

Fulltext Word Count: 6072 
English Abstract 

Pattern-matching circuitry which permits the simultaneous or near 
simultaneous matching of entire lines of text to a regular expression 
test pattern. The circuitry comprises: a plurality of symbol recognition 
cells (21-23) means for applying each one of said symbol signals (li - 
In) to a respective one of said cells, means (24) for sequentially 
applying each of said pattern test signals to all of said cells, and, 
means (26) for detecting the sequential appearance of said pattern test 
signals in successive ones of said cells. In one embodiment the symbol 
recognition cells are variants of asynchronous finite state machines. 

French Abstract 

Un circuit de determination de concordances de modeles permet la 
determination simultanee ou quasi-simultanee de concordances de lignes 
entieres de texte avec un modele de test d* expressions courantes. Le 
circuit comprend une pluralite de cellules de reconnaissance de symboles 
(21, 23), des moyens d' application de chaque signal de symboles (li-In) a 
une cellule respective de reconnaissance, des moyens (24) pour appliquer 
sequentiellement chacun de ces signaux de test de modele a toutes ces 
cellules et des moyens (26) de detection de 1' apparition sequentielle de 
ces signaux de test de modele dans des cellules successives. Dans un mode 
de realisation, les cellules de reconnaissance de symboles sont des 
variantes de machines asynchrones d'etat fini. 

Main International Patent Class: G06F-015/40 

Fulltext Availability: 
Detailed Description 

Detailed Description 

. . . time, there has been a steady shift from such primarily 
arithmetic applications to a heavy dependence on symbolic 
computationt typified by document preparation and text 

searching , often as part of so-, called "office automation" 
systems. However, the basic architecture of computers has 
not been dramatically affected by this change in 
applications . . . 



.remain essentially serial machines 



with elaborate arithmetic capabilities and therefore do not 
provide efficient non0numeric computation. 
Fundamental process used in office automation 

systems is searching through lines of text for given words, 
phrases, or sentences. This pattern-matching is 
particularly vital to locating pertinent information in 
large masses of textual data available in machine-readable 
form# i.e., in large data bases . Indeed,, the well being 
the information age is heavily dependent on fast and 

efficient methods of locating pertinent information i 
large collections of such information. Again, however, the 
traditional design of general purpose digital computers, as 
essentially arithmetic, serial machines, has not lent 
itself to fast and efficient pattern- searching . indeed, 
because of the large volume of data which must be processed 
in a serial manner, many data retrieval problems become 
uneconomical; extensive indexing schemes are resorted to in 

TlOlt; 

order to limit the field of search . 
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Set Items Description 

51 906018 (PERFORM? OR PROFICIEN? OR EFFECTIV? OR EFFICIEN? OR SUCCE- 

SS?) (5N) {RULE? ? OR TEMPLATE? ? OR STRATEG? OR FILTER? ? OR P- 
LAN OR PLANS OR POLICY OR POLICIES OR PROFILE? ? OR METHOD? OR 
EXPRESSION? ? OR STATEMENT? ? OR PHRASE? ? OR STRING? ?) 

52 46149 (PERFORM? OR PROFICIEN? OR EFFECTIV? OR EFFICIEN? OR SUCCE- 

SS?) (5N) (SEARCH OR QUERY OR QUERIES) 

53 584268 (MATCH??? OR LOCATE? ? OR LOCATING OR FIND??? OR RETRIEV? - 

OR OBTAIN?) (5N) (RECORD? ? OR DOCUMENT? ? OR FILE? ? OR PAGE? ? 
OR WEBPAGE? ? OR SITE? ? OR WEBSITE? ? OR HIT? ? OR URL? ? OR 
RESOURCE 0 LOCATOR? ? OR OBJECT? ? OR DATA) 

54 45104 9 (MATCH??? OR LOCATE? ? OR LOCATING OR FIND??? OR RETRIEV? - 

OR OBTAIN?) (5N) (IMAGE? ? OR PICTURE? ? OR PHOTO? ? OR PHOTOGR- 
APH? ? OR CLIP? ? OR INFORMATION OR ARTICLE? ?} 

55 13364 61 DATABASE? ? OR DATA () BASE? ? OR REPOSITOR??? OR SEARCH??? - 

OR QUERY??? OR QUERIE? ? 

56 2506 SI (5N)S3:S4 (5N) (MEASUR? OR ASSESS? OR EVALUAT? OR ANALYZ? - 

OR ANALYS? OR CHECK??? OR GAUG??? OR DETERMIN? OR STATISTIC? - 
OR ESTIMAT??? OR TEST??? OR CALCULAT? OR ASCERTAIN? OR FIND??? 
OR COMPUTE OR COMPUTES OR COMPUTED OR COMPUTING) 

57 623 S5 AND S6 

58 885 S2 (5N)S3:S4 (5N) (MEASUR? OR ASSESS? OR EVALUAT? OR ANALYZ? - 

OR ANALYS? OR CHECK??? OR GAUG??? OR DETERMIN? OR STATISTIC? - 
OR ESTIMAT??? OR TEST??? OR CALCULAT? OR ASCERTAIN? OR FIND??? 
OR COMPUTE OR COMPUTES OR COMPUTED OR COMPUTING) 

59 283 S8 AND (DATABASE? ? OR DATA()BASE? ? OR REPOSITOR???) 

510 345 S7 AND (DATABASE? ? OR DATA()BASE? ? OR REPOSITOR???) 

511 159 SIO AND (SEARCH??? OR QUERY??? OR QUERIE? ?) 

512 124264 (MATCH??? OR LOCATE? ? OR LOCATING OR FIND???) (5N) (RECORD? 

? OR DOCUMENT? ? OR FILE? ? OR PAGE? ? OR WEBPAGE? ? OR SITE? 
? OR WEBSITE? ? OR HIT? ? OR URL? ? OR RESOURCE () LOCATOR? ? OR 



OBJECT? ? OR DATA) 

' S13 69013 (MATCH??? OR LOCATE? ? OR LOCATING OR FIND???) (5N) (IMAGE? ? 

OR PICTURE? ? OR PHOTO? ? OR PHOTOGRAPH? ? OR CLIP? ? OR INF- 
ORMATION OR ARTICLE? ?) 
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05484018 E.I. No: EIP00025039041 

Title: Object based image retrieval through learning from user search 
patterns and profiles 

Author: Xu, Yaowu; Saber, Eli; Murat Tekalp, A. 
Corporate Source: Univ of Rochester, Rochester, NY, USA 

Conference Title: Proceedings of the 2000 'Storage and Retrieval for 
Media Databases 2000' 

Conference Location: San Jose, CA, USA Conference Date: 
19000126-19000128 

Sponsor: IS and T; SPIE 

E.I. Conference No.: 56354 

Source: Proceedings of SPIE - The International Society for Optical 
Engineering v 3972 2000. p 234-239 
Publication Year: 2000 
CODEN: PSISDG ISSN: 0277-786X 
Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical) 
Journal Announcement: 0004W1 

Abstract: In this paper, we propose a self-learning content-based image 
indexing and retrieval system. Our system employs a hierarchical content 
representation (consisting of objects and regions) and a hierarchical 
content matching method for effective and efficient image /object 
retrieval. The 'learning' behavior is enabled by our proposed hierarchical 
content representation which allows easy storage of combinations of regions 
that have resulted in successful matches to objects of interest as 
determined by user search patterns and profiles . The learning step 
effectively performs an automatic analysis of database images into 
meaningful objects given certain user search patterns and interest 
profiles. The advantages of the proposed hierarchical content 
representation and 'learning' schemes are demonstrated on a collection of 
car and face images, where the significant improvements in search and 
retrieval speed are described both theoretically and experimentally. 
(Author abstract) 13 Refs. 

Descriptors: Information retrieval systems; Indexing (of information); 
Database systems; Learning systems 

Identifiers: Self-learning content-based image indexing and retrieval 
systems; Hierarchical content description 

Classification Codes: 

903.3 (Information Retrieval & Use); 723.5 (Computer Applications); 

903.1 (Information Sources & Analysis); 723.3 (Database Systems); 723.4 

(Artificial Intelligence) 

903 (Information Science); 723 (Computer Software) 

90 (GENERAL ENGINEERING) ; 72 (COMPUTERS & DATA PROCESSING) 
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Title: Location hashing: an efficient indexing method for locating 
object queries in image databases 
Author: Syeda-Mahmood, Tanveer 

Corporate Source: IBM Almaden Research Cent, San Jose, CA, USA 

Conference Title: Proceedings of the 1999 7th Conference of the Storage 
and Retrieval for Image and Video Databases VII 

Conference Location: San Jose, Ca, USA Conference Date: 
19990126-19990129 

Sponsor: Is and T; SPIE 

E.I. Conference No.: 55094 

Source: Proceedings of SPIE - The International Society for Optical 
Engineering v 3656 1999. p 366-378 
Publication Year: 1999 
CODEN: PSISDG ISSN: 0277-786X 



Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical) 
Journal Announcement: 9908W1 

Abstract: Queries referring to content embedded within images are an 
essential component of content -based search , browse, or summarize 
operations in image databases . Localization of such queries under 
changes in appearance, occlusions and background clutter, is a difficult 
problem, for which current spatial access structures in databases are not 
suitable. In this paper we present a new method of indexing image 
databases called location hashing that uses a special data structure 
called the location hash tree (LHT) for organizing feature information from 
images of a database . Location hashing is based- on the principle of 
geometric hashing and determines simultaneously, the relevant images in the 
database and the regions within them that are most likely to contain a 2d 
pattern query without incurring detailed search of either. The location 
hash tree being a red-black tree, allows for efficient search for 
candidate locations using pose-invariant feature information derived from 
the query . (Author abstract) 21 Refs. 

Descriptors: Database systems; Image processing; Indexing (of 
information) ; Data structures; Computational geometry; Two dimensional; 
Online searching 

Identifiers: Location hash tree; Location hashing; Image databases 

Classification Codes: 

723.3 (Database Systems); 723.2 (Data Processing); 903.1 (Information 
Sources & Analysis); 723.5 (Computer Applications) 
723 (Computer Software); 903 (Information Science) 
72 (COMPUTERS & DATA PROCESSING); 90 (GENERAL ENGINEERING) 
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04423883 E.I. No: EIP96063215282 

Title: Design of large intelligent image database systems 
Author: Huang, P.W.; Jean, Y.R. 

Corporate Source: Natl Chung-Hsing Univ, Taichung, Taiwan 

Source: International Journal of Intelligent Systems v 11 n 6 Jun 1996. p 
347-365 

Publication Year: 1996 

CODEN: IJISED ISSN: 0884-8173 

Language: English 

Document Type: JA; (Journal Article) Treatment: A; (Applications); X; 
( Experimental ) 

Journal Announcement: 9608W2 

Abstract: Image knowledge structure and an access method for retrieving 
images are two of the most important problems in the design of large 
intelligent image data base systems. In this article, we present a 
software architecture which utilizes an image knowledge structure, called 
2D C** plus -string, as the theoretical basis for intelligent image data 
base systems. Our architecture also provides an efficient access method 

to quickly locate the desired images by pruning a large percentage of 
nonpromising iconic indices represented by 2D C** plus -strings. The 2D C** 
plus -string spatial knowledge representation makes the image data base 
system become more intelligent, while the access method supports effective 
image retrieval from a large image data base without degrading the 
system's overall performance. Experimental results show that our access 
method is flexible enough to adapt to the change of image data base and 
very efficient in reducing the number of searches in similarity 
retrieval. (Author abstract) 11 Refs. 

Descriptors: Database systems; Computer aided software engineering; 
Knowledge representation; Image processing; Information retrieval; Object 
oriented programming; Indexing (of information) 

Identifiers: Large intelligent image database systems; Image knowledge 
structure; Image retrieval; Iconic indices representation 

Classification Codes: 

723.3 (Database Systems); 723.5 (Computer Applications); 721.1 
(Computer Theory, Includes Formal Logic, Automata Theory, Switching Theory, 



Prograinining Theory); 723.2 (Data Processing); 903.3 (Information 
Retrieval & Use); 723.1 (Computer Programming) 

723 (Computer Software); 721 (Computer Circuits & Logic Elements); 903 
(Information Science) 

72 (COMPUTERS & DATA PROCESSING); 90 (GENERAL ENGINEERING) 
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04366698 E.I. No: EIP95122 93758 9 

Title: Object recognition by multiresolutional template matching 

Author: Prasad, Lakshman; Rao, Ramana L. 

Corporate Source: Los Alamos Natl. Lab., Los Alamos, NM, USA 
Conference Title: Digital Image Storage and Archiving Systems 
Conference Location: Philadelphia, PA, USA Conference Date: 
19951025-19951026 

Sponsor: SPIE - Int Soc for Opt Engineering, Bellingham, WA USA 
E.I. Conference No.: 22420 

Source: Proceedings of SPIE - The International Society for Optical 
Engineering v 2606 1995. Society of Photo-Optical Instrumentation 
Engineers, Bellingham, WA, USA. p 300-311 

Publication Year: 1995 

CODEN: PSISDG ISSN: 0277-786X ISBN: 0-8194-1970-2 
Language: English 

Document Type: CA; (Conference Article) Treatment: T; (Theoretical); A; 
(Applications) 

Journal Announcement: 9605W3 

Abstract: A novel and efficient pose-invariant guided template 
matching algorithm, for object recognition in images , is proposed. 
Template matching is performed in a rotation-scale-translation (pose) 
invariant fashion, thus greatly reducing the 4-d search space to a single 
point. The invariance is achieved by preprocessing the input image and 
associating certain geometric descriptors with the objects in the image. 
These descriptors completely characterize the affine parameters associated 
with the objects which must be applied to candidate templates, and the 
location in the image where the template is to be applied for a match. 
Preprocessing and matching are performed on a wavelet pyramidal 
decomposition of the image in a multiresolutional coarse-to-fine fashion 
for computational efficiency. An efficient search strategy is also 
proposed for selecting templates in the template database . 22 Ref s . 

Descriptors: Pattern recognition; Database systems; Efficiency; Image 
processing; Algorithms 

Identifiers: Template matching; Rotation-scale-translation invariance; 
Affine parameters; Wavelet pyramidal decomposition 

Classification Codes: 

723.4 (Artificial Intelligence); 741.1 (Light /Optics ) ; 723.3 (Database 
Systems); 723.1 (Computer Programming) 

723 (Computer Software); 741 (Optics & Optical Devices) 
72 (COMPUTERS & DATA PROCESSING) ; 74 (OPTICAL TECHNOLOGY) 
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Title: Design of a DFSA based pattern matcher. 

Author: Fan, Jang- Jong; Su, Keh-Yih 

Corporate Source: Natl Tsing Hua Univ, Hsinchu, Taiwan 

Source: Chung-kuo Kung Ch'eng Hsueh an/ Journal of the Chinese Institute 
of Engineers v 14 n 3 May 1991 p 325-331 
Publication Year: 1991 
CODEN: CKCKDZ ISSN: 0253-3839 
Language: English 

Document Type: JA; (Journal Article) Treatment: G; (General Review) 
Journal Announcement: 9111 



Abstract: In order to eliminate the irrelevant data being transferred 
from disks into memory, many database machines adopt a data filter to 
filter out the irrelevant data on-the-fly. This paper proposes a new DFSA 
(Deterministic Finite State Automata) based pattern matcher to be 
implemented as the data filter . It can perform various searching 
operations, including fixed and variable length don't care patterns. The 
architecture of this pattern matcher is relatively simple and can be easily 
implemented by off-the-shelf LSI chips. In our design, it only requires one 
16K multiplied by 8 SRAM and five LSI chips for a given state transition 
table of 128 states. A test has been conducted to compare the performance 
of dBASE III and our data filter. The result shows that the data filter is, 
on average, five times better than the performance of dBASE III. (Author 
abstract) 10 Refs. 

Descriptors: DATABASE SYSTEMS; DATA PROCESSING — Data Acquisition; DATA 
TRANSMISSION; INTEGRATED CIRCUITS, LSI 

Identifiers: DATABASE MACHINE DATA FILTER; DATABASE TEXT RETRIEVAL 
SYSTEM; DETERMINISTIC FINITE STATE AUTOMATA (DFSA) 

Classification Codes: 

723 (Computer Software); 713 (Electronic Circuits) 

72 (COMPUTERS & DATA PROCESSING); 71 (ELECTRONICS & COMMUNICATIONS) 
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01673375 ORDER NO: AAD99-08971 

INDEXING AND MINING MULTIMEDIA DATABASES 

Author: KORN, PHILIP RUSSELL (FLIP) 

Degree: PH.D. 

Year: 1998 

Corporate Source/Institution: UNIVERSITY OF MARYLAND COLLEGE PARK (0117) 
Chairperson: CHRISTOS FALOUTSOS 

Source: VOLUME 59/10-B OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 54 43. 142 PAGES 
Descriptors: COMPUTER SCIENCE ; INFORMATION SCIENCE 
Descriptor Codes: 0984; 0723 

In this thesis we examine methods for two related problems: (a) 
indexing multimedia databases for fast searching , and (b) mining data 
from a multimedia database to extract patterns, correlations, rules and 
outliers . 

We first investigate the problem of retrieving similar shapes from a 
large database ; in particular, we focus on medical tumor shapes ( * Find 
tumors that are similar to a given pattern.*) . We use a natural similarity 
function for shape matching based on concepts from mathematical morphology 
and show how it can be lower-bounded by a set of shape features for safely 
pruning candidates, thus giving fast and correct output. These features can 
be organized in a spatial access method, leading to fast indexing for range 

queries and nearest neighbor queries . In addition to the 
lower-bounding, our second contribution is the design of a fast algorithm 
for nearest neighbor search , achieving significant speedup while provably 
guaranteeing correctness. Our experiments demonstrate that roughly 90% of 
the candidates can be pruned using these techniques, resulting in up to 27 
times better performance compared to sequential scan. 

The second problem is data mining. We propose methods for 
efficiently finding * interesting things' from large data sets in the 
form of summaries, outliers, and rules, based on powerful tools from 
statistics. Our first goal was to compress a very large data set in a 
manner that efficiently supports ad hoc queries , as is often desired for 
data mining. Compressed data is notoriously difficult to access randomly. 
We show how this can be done in three passes, provided that a small amount 
of lossiness can be tolerated. The proposed method, called SVDD, achieves 
an astounding 0.5% reconstruction error with a space requirement under 2% 
(i.e., a 50:1 compression ratio). 

A good data compression method naturally leads to data mining. All 
compression methods operate by detecting patterns, or rules, which are 
exploited to reduce redundancy. In the last part of this thesis, we 



consider the problem of discovering, interpreting, and using such rules, 
given data organized in a matrix of, e.g., customers x products. We propose 
a new paradigm, namely. Ratio Rules, which are quantifiable in that one can 
measure the ability of the rules to recover values that are pretended to be 
unknown; this is in contrast to the well known association rules. Another 
contribution is a novel method to guess missing -values from the rules. 
(Abstract shortened by UMI . ) 



19/5/16 (Item 4 from file: 35) 

DIALOG (R) File 35 : Dissertation Abs Online 

(c) 2003 ProQuest Inf o&Learning . All rts. reserv. 

920997 ORDER NO: AAD86-13550 
QUERY OPTIMIZATION IN RELATIONAL DATABASE SYSTEMS 
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Degree: PH.D. 
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PAGE 1627. 162 PAGES 
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This dissertation examines the optimization problem of query- 
processing in relational database systems. To provide the user with 
non-procedural queries , the database management system takes the 
responsibility of finding a method to access data . The efficiency 
of the query processing method used dominates the performance of the 
database system. The usual approach to selecting a processing method is to 
use heuristics. If often fails to capture the most efficient method. 
Therefore a better optimization scheme is the main objective of this work. 

A formal model for the methods of processing one-variable queries 
in relational calculus is developed. In this model, a query and the 
database organization are represented by a hyper-graph. Any processing 
method is a sub-graph of this hyper-graph with a cost. The optimization 
problem is to find the subgraph with the minimum cost. Using this model, 
all sub-graphs can be enumerated, classified, and characterized. Based on 
the properties of the sub-graphs, we can show that some of them can never 
be optimal. With the aid of the properties, exact optimization algorithms 
are given, which avoid most non-optimal sub-graphs. Experiments show that 
the algorithms reduce the search space considerably. Several heuristic 
and simpler algorithms are also presented to speed up the processing time. 

In this dissertation, a set of processing methods larger than 
those in previous work is explored. Different usages of compound indices 
are considered. The target retrieval in special cases is investigated. It 
is shown that finding the optimal processing method is NP-complete even for 
one-variable queries . 
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How are you going to find it: what librarians don't know, think they know, 
want to know, and should know about Web search engines. 
Author(s): Brunning, Dennis 
Editor(s): Williams, Martha E 

Corporate Source: Arizona State University, Tempe, AZ 

vol. 21, pages 55-60 
Publication Date: 2000 
ISSN: 1573-7059 

Conference Title: Proceedings of the National Online Meeting 
Conference Location: New York, NY 
Conference Date: May 16-18, 2000 



Publisher: Information Today, Inc. 
Language: English 

Place of Publication: Medford, NJ 
Document Type: Journal Article 
Record Type: Abstract 
Journal Announcement: 3503 

Current estimates put the indexable Web at 320 million hypertext 
formatted pages. Of these pages, available search engines index only 
one-third. Within this parameter are wide variations in indexing, scope, 
coverage, method , and performance , which create huge problems of 
finding pages relevant to queries typically handled by reference 
librarians. Focuses on the dilemmas, solutions, strategies, and questions 
that librarians face in using Web search engines. Uses a mix of inquiry 
tools, including focus groups, participant observation, and surveys to 
assist in studying the following factors: (1) which search engines 
reference librarians rely upon in daily practice and why; (2) who they keep 
up-to-date on search engine technology and interfaces; (3) how well they 
understand the scope of search engine coverage; (4) how they go about 
confirming their understanding of the scope, coverage, searching 
interface, and performance of search engines; and (5) the design, 
database scope and coverage, and performance features that they wish to be 
incorporated into search engines. Results can be used to advance search 
engine design, performance, and standards, and also underscore and 
elaborate how reference librarians cope with this ever-evolving powerful 
technology. 

Descriptors: Searching ; Reference librarians; Internet; User studies 
Classification Codes and Description: 2.04 (User and Usage Studies); 5.11 ( 

Searching and Retrieval) 
Main Heading: Research Methods; Information Processing and Control 
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Toward more comprehensive Web searching . Single searching versus 
megasearching . 

Author (s): Notess, G 

Corporate Source: Montana State Univ, Bozeman, MT 

Online vol. 22, no. 2, pages 73-76 

Publication Date: Mar-Apr 1998 

ISSN: 0146-5422 

Language: English 

Document Type: Journal Article 

Record Type: Abstract 

Journal Announcement: 3300 

The author's earlier exploration of the major Web indexes demonstrates 
that these search engines have much less overlap than expected, and that 
they contain records unique to each of them. Given the number of unique 
items in each database and the lack of duplication, efficient 
strategies must be found for finding the needed information via these 
search engines. Using a multiple engine search tool, such as Inference 
Find, Dogpile, and MetaFind, is one appropriate approach. A second method 
is to search the largest of the databases one by one, using the command 
language unique to each to increase the precision of the search . Both the 
megasearch and the single tool methods are useful, but neither fully 
addresses the problem of trying to run a comprehensive search of Web 
sites. The advantages and limitations of each approach are examined. 

Descriptors: Databases ; Internet; Searching 

Classification Codes and Description: 5.11 ( Searching and Retrieval); 

6.01 (Networks, Regional Systems, Consortia 
Main Heading; Information Processing and Control; Information Systems and 

Applications 
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6919311 INSPEC Abstract Number: C2001-06-6160S-027 
Title: Efficient region query processing by optimal page ordering 

Author (s): Dae-Soo Cho; Bong-Hee Hong 

Author Affiliation: Dept. of Comput . Eng., Pusan Nat. Univ., South Korea 

Conference Title: Current Issues in Databases and Information Systems. 
East-European Conference on Advances in Databases and Information Systems 
Held Jointly with International Conference on Database Systems for Advanced 
Applications, ADBIS-DASFAA 2000. Proceedings (Lecture Notes in Computer 
Science Vol.1884) p. 315-22 

Editor(s): Stuller, J.; Pokorny, J.; Thalheim, B.; Masunaga, Y. 

Publisher: Springer-Verlag, Berlin, Germany 

Publication Date: 2000 Country of Publication: Germany xiii+396 pp. 
ISBN: 3 540 67977 4 Material Identity Number: XX-2000-02611 

Conference Title: Current Issues in Databases and Information Systems 
Conference Sponsor: INTAX; Hewlett Packard; KOMIX; Smart 4U; DCIT 
Conference Date: 5-9 Sept. 2000 Conference Location: Prague, Czech 
Republic 

Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P); Theoretical (T) ; Experimental (X) 
Abstract : A number of algorithms of clustering spatial data for reducing 
the number of disk seeks required to process spatial queries have been 
developed. One of the algorithms is the scheme of page ordering, which is 
concerned with the order of pages in one-dimensional storage for storing 
two-dimensional spatial data. The space filling curves, especially the 
Hilbert curves, have been so far used to impose an order on all of the 
pages. Page ordering based on the space filling curves, however, does not 
take into account the uneven distribution of spatial objects and the types 
of spatial queries . We develop a cost model to define the page ordering 
problem based on performance measurement and then find out the method 
of page ordering for efficiently processing region queries in static 
databases . The experimental results will show that the newly proposed 
ordering method achieves better clustering than older methods. (15 Refs) 
Subfile: C 

Descriptors : database theory; query processing; software performance 
evaluation; spatial data structures; visual databases 

Identifiers: region query processing; optimal page ordering; spatial 
data clustering; spatial queries ; one-dimensional storage; 
two-dimensional spatial data; space filling curves; Hilbert curves; cost 
model; performance measurement; static databases ; experimental results 

Class Codes: C6160S (Spatial and pictorial databases); C4250 (Database 
theory); C6120 (File organisation) 

Copyright 2001, lEE 



19/5/26 (Item 7 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2003 Institution of Electrical Engineers. All rts. reserv. 
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Title: Matching overlapping objects in 3-D scene to model -base under a 

heuristic search strategy- 
Author (s): Siwei Lu 

Author Affiliation: Dept. of Comput. Sci., Memorial Univ. of 
Newfoundland, St. John's, Nfld., Canada 

Conference Title: Proceedings of the 5th International Conference on 
Image Analysis and Processing. Progress in Image Analysis and Processing 
p. 478-82 

Editor (s): Cantoni, V.; Cordelia, L.P.; Levialdi, S.; di Baja, G.S. 
Publisher: World Scientific, Singapore 

Publication Date: 1990 Country of Publication: Singapore xiv+787 pp. 
ISBN: 981 02 0061 7 

Conference Date: 20-22 Sept. 1989 Conference Location: Positano, Italy 



Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: This paper presents a model-base vision system which is capable 
of establishing a correspondence between an input scene with several 
objects (one maybe partially occluded by the other) and the 3-D model 
descriptions in a database (model-base) . First, the geometric information 
from a range data is transformed into an attributed hypergraph 
representation (AHR) . Then the objects in a scene are recognized by finding 
a feasible graph monomorphism between the object graph and the model graph. 
However, it becomes difficult to perform such matching in a reasonable 
amount of searches when the number of the vertices in the graphs, the 
number of models in database , and the objects in a scene increase. In 
order to achieve the flexibility and the efficiency , a heuristic search 
strategy is introduced for locating and recognizing objects at 
reasonable speed in spite of the indispensable variations. (4 Refs) 

Subfile: C 

Descriptors: computer vision; computerised picture processing 
Identifiers: overlapping objects matching; 3-D scene; model-base; 

heuristic search strategy; vision system; 3-D model descriptions; 

database ; attributed hypergraph representation; feasible graph 

monomorphism; object graph; model graph 
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Author Affiliation: Bell Labs., Holmdel, NJ, USA 
Journal: Proceedings of the IEEE vol.67, no. 12 p. 1624-42 
Publication Date: Dec. 1979 Country of Publication: USA 
CODEN: lEEPAD ISSN: 0018-9219 

Language: English Document Type: Journal Paper (JP) 
Treatment: Theoretical (T) 

Abstract: Presents and analyzes an effective and practical method of 
accomplishing partial- match retrieval on a computer file containing a 
large number of information records. In partial-match retrieval a subset of 
the records in the file is selected and retrieved by specifying a query 
set consisting of a small number of key values; the records selected and 
retrieved are those having a match to all the key values in the query 
set. Partial-match retrieval can be a powerful capability when used in 
information retrieval systems, and a stored file augmented with such a 
capability is equivalent to an associative or content-addressable store. 
The method presented in this paper is based upon the use of superimposed 
codes. A new algorithm is presented for generating superimposed codes 
without the use of a stored code dictionary. The new algorithm executes 
rapidly on a number generator; it allows the generation of binary codes 
having any desired width and weight. (44 Refs) 

Subfile: C 

Descriptors: content-addressable storage; database management systems; 
information retrieval 

Identifiers: superimposed codes; query set; key values; information 
retrieval systems; partial match retrieval; large computer files; content 
addressable store; pseudo random number generator 

Class Codes: C6120 (File organisation); C6160 (Database management 
systems (DBMS) ) 
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Using Inductive Learning to Generate Rules for Semantic Query 



Optimization 

(Research rept ) 

Hsu, C. N. ; Knoblock, C. A. 

University of Southern California, Marina del Rey. Information Sciences 
Inst. 

Corp, Source Codes: 045598002; 407952 

Jun 95 22p 

Languages: English 

Journal Announcement: GE^I9620 

Order this product from NTIS by: phone at 1-800-553-NTIS (U.S. 
customers); (703)605-6000 (other countries); fax at (703)321-8547; and 
email at orders@ntis.fedworld.gov. NTIS is located at 5285 Port Royal Road, 
Springfield, VA, 22161, USA. 

NTIS Prices: PC A03/MF AOl 

Country of Publication: United States 

Contract No.: F30602-91-C-008 1 ; NSF-IRI93-13993 

Semantic query optimization can dramatically speed up database query 
answering by knowledge intensive reformulation. But the problem of how to 
learn the required semantic rules has not been previously solved. This 
report presents a learning approach to solving this problem. In our 
approach, the learning is triggered by user queries . Then the system uses 
an inductive learning algorithm to generate semantic rules . This inductive 
learning algorithm can automatically select useful join paths and 
attributes to construct rules from a database with many relations. The 
learned semantic rules are effective for optimization because they will 
match query patterns and reflect data regularities. Experimental 

results show that this approach learns sufficient rules for optimization 
that produces a substantial cost reduction. 

Descriptors: Optimization; ^Semantics; ^Interrogation; Data bases ; 
Algorithms; Problem solving; Reduction; Costs; User needs; Learning 

Identifiers: Inductive learning; Sqo (Semantic query optimization); 
NTISDODXA 

Section Headings: 62B (Computers, Control, and Information 
Theory — Computer Software); 62GE (Computers, Control, and Information 
Theory--General ) 
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Object based image retrieval through learning from user search patterns 
and profiles 

Storage and retrieval for media databases 2000 : San Jose CA, 26-28 
January 2000 

YAOWU XU; SABER E; MURATTEKALP A 

YEUNG Minerva M, ed; BOON-LOCK YEO, ed; BOUMAN Charles A, ed 
Department of Electrical and Computer Engineering, University of 
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International Society for Optical Engineering, Bellingham WA, United 
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Journal: SPIE proceedings series, 2000, 3972 234-242 
ISBN: 0-8194-3590-2 ISSN: 1017-2653 Availability: INIST-21760; 
354000090077180230 

No. of Refs. : 13 ref . 

Document Type: P (Serial); C (Conference Proceedings) ; A (Analytic) 
Country of Publication: United States 
Language: English 

In this paper, we propose a self-learning content-based image indexing 
and retrieval system. Our system employs a hierarchical content 
representation (consisting of objects and regions) and a hierarchical 
content matching method for effective and efficient image /object 
retrieval. The "learning" behavior is enabled by our proposed hierarchical 
content representation which allows easy storage of combinations of regions 



that have resulted in successful matches to objects of interest as 
determined by user search patterns and profiles. The learning step 
effectively performs an automatic analysis of database images into 
meaningful objects given certain user search patterns and interest 
profiles. The advantages of the proposed hierarchical content 
representation and "learning" schemes are demonstrated on a collection of 
car and face images, where the significant improvements in search and 
retrieval speed are described both theoretically and experimentally. 

English Descriptors: Information retrieval; Indexing; Method; Hierarchic 
relation; Representation; Similarity; Learning; Artificial intelligence; 
Feedback regulation; Automated processing; Image; Image content 

French Descriptors: Recherche information; Indexation; Methode; Relation 
hierarchique; Representation; Similitude; Apprentissage; Intelligence 
artif icielle; Retroaction; Traitement automatise; Image; Contenu image 

Classification Codes: 001A01F03; 205 
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Title: IME: An image management environment with content-based access 

Author: Abate, Andrea F. ; Nappi, Michele; Tortora, Genny; Tucci, Maurizio 

Corporate Source: Univ of Salerno, Salerno, Italy 

Source: Image and Vision Computing v 17 n 13 1999. p 967-980 

Publication Year: 1999 

CODEN: IVCODK ISSN: 0262-8856 

Language: English 

Document Type: JA; {Journal Article) Treatment: T; (Theoretical) 
Journal Announcement: 0001W3 

Abstract: The article describes an experimental visual environment to 
handle digital images by contents. A suitable spatial index is used to 
organize the images in a spatial access structure for efficient storage and 
retrieval. An image is indexed according to both the spatial arrangement of 
its objects and the morphological and geometrical measures of these 
objects. Therefore, in the database population phase a user identifies 
the objects that characterize the visual content of each image by a 
user-friendly interface. In order to let the system retrieve images based 
on the presence of given patterns, it is necessary to define similarity 
matching criteria between a query and an image . To efficiently 
perform such a match , each image is stored together with a collection 
of metadata that are a very compact representation of the visual contents 
of the image. These metadata form the index of the image. The system 
implements a Spatial Access Method based on k-d-trees to achieve a 
significant speedup over sequential search. We prove the effectiveness and 
the efficiency of the system by performing standard tests on a database 
containing a large number of medical images, namely lung CT scans. (Author 
abstract) 36 Refs. 

Descriptors: Image analysis; Distributed database systems; Data 
acquisition; Indexing (of information) ; Mathematical morphology; 
Computational geometry; Interfaces (computer); Query languages; Data 
structures 

Identifiers: Image management; Content based retrieval; Spatial index; 
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723.2 (Data Processing); 723.3 (Database Systems); 903.1 (Information 
Sources & Analysis); 723.5 (Computer Applications) 

741 (Optics & Optical Devices); 723 (Computer Software); 903 
(Information Science); 921 (Applied Mathematics) 
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Title: Feature identification as an aid to content-based image retrieval 

Author: Ramesh, Nagarajan; Sethi, Ishwar K. 
Corporate Source: Wayne State Univ., Detroit, MI, USA 

Conference Title: Storage and Retrieval for Image and Video Databases III 
Conference Location: San Jose, CA, USA 

Sponsor: SPIE - Int Soc for Opt Engineering, Bellingham, WA USA 
E.I. Conference No. : 22192 

Source: Proceedings of SPIE - The International Society for Optical 
Engineering v 2420 1995. Society of Photo-Optical Instrumentation 
Engineers, Bellingham, WA, USA. p 2-11 

Publication Year: 1995 
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Language: English 

Document Type: CA; (Conference Article) Treatment: T; (Theoretical); A; 
(Applications) 

Journal Announcement: 9609W3 



Abstract: In recent years, databases have evolved from storing pure 
textual information to storing multimedia information - text, audio, video, 
and images. With such databases comes the need for a richer set of search 
keys that include keywords, shapes, sounds, examples, sketches, color, 
texture and motion. In this paper we address the problem of image retrieval 
where keys are object shapes or user sketches. In our scheme, shape 
features are extracted from each image as it is stored. The image is first 
segmented and points of high curvature are extracted. Regions surrounding 
the points of high curvature are used to compute feature values by 
comparing the regions with a number of references. The references 
themselves are picked out from the set of orthonormal wavelet basis 
vectors. An ordered set of distance measures between each local region and 
the wavelet references form a feature vector. When a user queries the 
database through a sketch, the feature vectors for high curvature points 
on the sketch are determined. An efficient nearest neighbor search then 
yields a set of images which contain objects that match the user's 
sketch closely. The process is completely automated. Initial experimental 
results are presented. 10 Ref s . 

Descriptors: Information retrieval systems; Image processing; Vectors; 
Database systems; Query languages; Textures 

Identifiers: Feature identification; Content-based image retrieval; 
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Classification Codes: 
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Conference Title: Proceedings of the 1995 IEEE International Conference 
on Image Processing. Part 1 (of 3) 

Conference Location: Washington, DC, USA Conference Date: 
19951023-19951026 

Sponsor: IEEE 

E.I. Conference No.: 44184 

Source: IEEE International Conference on Image Processing v 1 1996. IEEE, 
Los Alamitos, CA, USA, 95CB358 19 . p 342-345 
Publication Year: 1996 
CODEN: 85QTAW 
Language : English 

Document Type: CA; (Conference Article) Treatment: T; (Theoretical) 
Journal Announcement: 9603W3 

Abstract: A new approach to retrieving images from a color image 
database is proposed in this paper. Each image in the database is 
represented by a pseudo two-dimensional hidden Markov model (2D PHMM) , 
where both the chromatic and spatial information about the image could be 
adequately involved. In addition, a pictorial querying method is used, by 
which the users can paint only rough contents of the desired images if 
their information concerning the desired images is not crisp. The use of 
the flexible querying mechanism and 2D PHMMs eliminates the drawbacks of 
utilizing textual descriptions to retrieve images . Furthermore, 
effective statistical matching between the query picture and each 
image in the database can be achieved. The probability of the query 
picture generated by each 2D PHMM is just considered as the matching score 
between the query and the corresponding image. The images which best 
satisfy the query can thus be retrieved. Promising experimental results 
show the feasibility and effectiveness of the proposed approach. (Author 
abstract) 7 Refs. 



Descriptors: Color image processing; Information retrieval; Mathematical 
models; Markov processes; Image reconstruction; Database systems; 
Statistical methods; Image segmentation; Probability; Indexing (of 
information) 
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querying method; Statistical matching; Two dimensional; Image database 
systems 
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Title: Demonstration of dissemination, storage, and retrieval of Defense 
Mapping Agency digital products over a distributed enterprise network 

Author: Mehring, J.W. 

Corporate Source: Hughes Information Technology Corp., Reston, VA, USA 
Conference Title: 23rd AIPR Workshop: Image and Information Systems: 
Applications and Opportunities 

Conference Location: Washington, DC, USA Conference Date: 19941012 
Sponsor: SPIE - Int Soc for Opt Engineering, Bellingham, WA USA 
E.I. Conference No.: 22173 

Source: Proceedings of SPIE - The International Society for Optical 
Engineering v 2368 1995. Society of Photo-Optical Instrumentation 
Engineers, Bellingham, WA, USA. p 173-182 

Publication Year: 1995 

CODEN: PSISDG ISSN: 0277-786X ISBN: 0-8194-1710-6 
Language: English 

Document Type: CA; (Conference Article) Treatment: A; (Applications); X 
; (Experimental) 

Journal Announcement: 950BW1 

Abstract: As the Defense Mapping Agency moves from a producer of hardcopy 
products to a data warehouse of geospatial products providing the user with 
the most current information accessible on-line, the architecture will 
migrate to a distributed set of massive databases connected via high 
speed local area and wide area networks and accessible by remote users to 
efficiently query , locate , and move the data of interest to them. A 
demonstration of a prototype system that incorporates some of the 
technologies that will be key to the development of the DMA future 
architecture was run in July of 1994. A remote client with a one meter very 
small aperture antenna (VSAT) was used to remotely access, via commercial 
satellite link, the data warehouse consisting of a nationwide set of 
distributed servers connected via asynchronous transfer mode (ATM) 
commercial communications links. The demonstration scenario simulated a 
?take and update' situation where a user has been deployed with geospatial 
data on CD-ROM and is able to access and download updates to the region of 
interest via satellite link. The user is also able to provide update 
information via upload to the central location and is able to collaborate 
with operators at the central location as to the details of the input from 
the remote site,. 2 Refs. 

Descriptors: Geographic information systems; Mapping; Image transmission; 
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transfer mode 

Identifiers: Distributed enterprise network; Defense Mapping Agency 
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723.2 (Data Processing); 722.1 (Data Storage, Equipment & Techniques); 



722.3 (Data Communication, Equipment & Techniques) 
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query evaluation 
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Conference Title: Proceedings of the 18th Annual International Computer 
Software & Applications Conference (COMPSAC 94) 

Conference Location: Taipei, Taiwan Conference Date: 19941109-19941111 
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E.I. Conference No.: 42509 

Source: Proceedings - IEEE Computer Society's International Computer 
Software & Applications Conference 1994. IEEE, Los Alamitos, CA, 
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Publication Year: 1994 

CODEN: PSICD2 ISSN: 0730-6512 

Language: English 

Document Type: CA; (Conference Article) Treatment: A; (Applications); T 
; (Theoretical) 

Journal Announcement: 9504W4 

Abstract: An object-oriented representation of databases which takes 
advantage of the structure of logical representation and an object-oriented 
algorithm that evaluates conjunctive queries in a massively parallel 
environment are reported and analyzed. The object-oriented representation 
is used to implement data level parallelism as data is encapsulated as an 
object. By distributing these objects in a database , parallel evaluation 
is achieved by a global pattern matching approach which is different from 
the conventional tuple-by-tuple, operation-by-operation evaluation 
strategy. This algorithm overcomes the data dependency imposed by shared 
variables which is a major problem in achieving AND-parallelism . Simulation 
results for 2 way and 3 way joins are analysed according to the effect of 
database size and join size. Results tend to show that the scheme proposed 
performs better for queries with large number of AND conjuncts. (Author 
abstract) 5 Refs. 
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(REPRINT); NIPPON TELEGRAPH & TEL PUBL CORP, BASIC RES 

LABS/ATSUGI/KANAGAWA 24301/JAPAN/ 
Journal: MACHINE VISION AND APPLICATIONS, 1998, Vll, Nl, P7-15 
ISSN: 0932-8092 Publication date: 19980000 

Publisher: SPRINGER VERLAG, 175 FIFTH AVE, NEW YORK, NY 10010 
Language: English Document Type: ARTICLE 
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Subfile: CC ENGI — Current Contents, Engineering, Computing & Technology 
Journal Subject Category: ENGINEERING, ELECTRICAL & ELECTRONIC; COMPUTER 

SCIENCE, ARTIFICIAL INTELLIGENCE; COMPUTER SCIENCE, CYBERNETICS 
Abstract: We present an efficient and accurate method for retrieving images 

based on color similarity with a given query image or histogram. The 

method matches the query against parts of the image using histogram 

intersection. Efficient searching for the best matching subimage is 

done by pruning the set of subimages using upper bound estimates. The 

method is fast, has high precision and recall and also allows queries 

based on the positions of one or more objects in the database image. 

Experimental results showing the efficiency of the proposed search 

method, and high precision and recall of retrieval are presented. 
Descriptors--Author Keywords: image retrieval ; color matching ; 

efficient search ; upper bound pruning ; precision and recall 
Identifiers--KeyWord Plus(R): EXPECTED COMPUTATIONAL COST; COLOR; REDUCE 
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Visual image retrieval by elastic matching of user sketches 

(Visuelle Bildwiedergewinnung auf der Basis der elastischen Anpassung . von 
Benutzerhandskizzen) 
Del Bimbo, A; Pala, P 

Dipartimento di Sistemi e Inf., Firenze Univ., Italy 

IEEE Transactions on Pattern Analysis and Machine Intelligence, vl9, n2, 
ppl21-132, 1997 

Document type: journal article Language: English 
Record type: Abstract 
ISSN: 0162-8828 

ABSTRACT: 

Effective image retrieval by content from database requires that visual 
image properties are used instead of textual labels to properly index and 
recover pictorial data. Retrieval by shape similarity, given a 
user-sketched template is particularly challenging, owing to the difficulty 
to derive a similarity measure that closely conforms to the common 
perception of similarity by humans. In this paper, we present a technique 
which is based on elastic matching of sketched templates over the shapes in 
the images to evaluate similarity ranks. The degree of matching achieved 
and the elastic deformation energy spent by the sketch to achieve such a 



match are used to derive a measure of similarity between the sketch and the 
images in the database and to rank images to be displayed. The elastic 
matching is integrated with arrangements to provide scale invariance and 
take into account spatial relationships between objects in multi-object 
queries. Examples from a prototype system are expounded with considerations 
about the effectiveness of the approach and comparative performance 
analysis . 
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index.) (Software Review) (Evaluation) 

Archer, Dick 
WinMag.com, NA 
Sept 27, 2000 

DOCUMENT TYPE: Evaluation LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 533 LINE COUNT: 00042 

TEXT: 

...to see larger image) Selecting document sources and types PC Data 
Finder is a unique search engine for locating misplaced files on your 
hard drives or removable media. By creating an... 

...in the selected directories, PC Data Finder is able to quickly locate 
files matching your search criteria. Once you have created an initial 
index, you can update it with the click. . . 

...less than 3 minutes. It encountered 74,905 unique words, which were 
added to a database . ReadMe.txt and ReadMe.doc were two of the files 
added to my hard drive. . . 

...format, the other in Word format. Examining these files, I spotted the 
following sentence: "PC Data Finder is a PC search engine enabling 
the user to perform full Boolean, proximity, phrase , and wildcard 
searches on documents in any directory or drive on your PC". The sentence 
contains the words... 

...as my test case, (click to see larger image) Results of the * engine 
NEAR7 boolean' search criteria For my first search , I tried just the 
word "engine" and PC Data Finder found 10 6 matches, including my 2 ReadMe 
files. Next I searched for "boolean" and turned up 77 matches, which 
again included my 2 files. For my third search , I entered "engine NEAR5 
boolean" which instructed PC Data Finder to locate files with the... 

...than 5 other words (remember that our files have 6 words in between). 
The ,NEAR5 search resulted in no matches. Finally, I entered "engine NEAR7 
boolean" and got 3 hits in... 

. . .was a Crystal Reports support file named techref.pdf. PC Data Finder had 
discovered my search criteria hidden away on page 434 of a 1,138 page PDF 
file! At $99. . . 
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Two fast pattern-matching algorithms. 

Otto, Erick 

C Users Journal, vl2, n2, p38(13) 
Feb, 1994 

ISSN: 0898-9788 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 4 271 LINE COUNT: 0032 9 

This article introduces two algorithms that are very fast in finding 
strings in large text files . I have implemented them to perform string 

searching in a text database of over 120 Megabytes. I needed 
algorithms that were very much used — one-by-one... 
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01663831 SUPPLIER NUMBER: 15003730 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

USA Flex 486 D40 Performer. (Hardware Review) (one of five evaluations of 

non-Intel 486-based microcomputers in 'Dare to be Different') 

(Evaluation) 

Lockwood, Russ 

Windows Sources, v2, n2, p202{l) 
Feb, 1994 

DOCUMENT TYPE: Evaluation ISSN: 1065-9641 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 8960 LINE COUNT: 00696 

... dBASE IV and ODBC in 1-2-3 's list box when you select Tools/ 

Database /Connect to External. Selecting ODBC brings up the standard ODBC 
Data Sources screen. After establishing the connection to the external 
database , you can select New Query , the results of which will come 
directly into your current spreadsheet. Additional Database menu items 
let you find , add, or delete records ; perform crosstabs; send SQL 
strings directly to your back end; and even create new tables on the back 
end (assuming. . . 



12/3, K/4 (Item 4 from file: 275) 
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dBASE IV 1.5: Borland's first upgrade keeps a classic alive - for now. 

(Software Review) (Borland International Inc. 's data base management 

system) (First Looks) (Evaluation) 

Ricciardi, Sal 

PC Magazine, vll, n9, p46(l) 

May 12, 1992 

DOCUMENT TYPE: Evaluation ISSN: 0888-8507 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 1016 LINE COUNT: 00077 

The new SET KEY TO command implements a smart key filter that lets 
you filter data based on an index expression. This is faster than the 
previous methods because it requires and uses an existing index to find 
the matching records . If your search expression includes an index 
expression , you'll get improved performance by using the SET KEY TO 
command. 

Several new functions provide information about indexes. The... 
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01240507 SUPPLIER NUMBER: 06302436 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Goldatabase. (Software Review ) (one of nine non -programmable relational 

database management system evaluations in 'Relational Databases: Taking 

the Middle Ground*) (evaluation) 

White, T.J. 

PC Magazine, v7, n8, pl72(2) 
April 26, 1988 

DOCUMENT TYPE: evaluation ISSN: 0888-8507 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 907 LINE COUNT: 00070 

... a menu choice, the help function walks you through the options 

available for that command. 

Searching for information in the database can be performed in two 
ways. For novices, Goldatabase provides a query -by-forms option, much 
like that of PFS: File . For experienced users, a Find command can be 



entered on the cominand line. Either search method can perform 
equal-to, less-than and greater-than queries as well as substring 
searches using the Contains option. 

Compound searches using the Find command are performed in multiple 
steps . . . 
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DOS FIND Sieves. 

Pakstis, A.J.; Somerson, P. 

PC Magazine, v4, n8, p247-248 

April 16, 1985 

DOCUMENT TYPE: column ISSN: 0888-8507 LANGUAGE: ENGLISH 

RECORD TYPE: ABSTRACT 

ABSTRACT: Conditional search operations can be implemented by nesting 
operations of the DOS FIND filter. A short batch file is presented 
which performs this procedure. The method is so efficient that for 
some users it may reduce the need for a data base management program. 
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03914100 Supplier Number: 50125720 (USE FORMAT 7 FOR FULLTEXT) 
-IFIS: Food & Nutrition Internet Index (FNII) now lavinched 

M2 Presswire, pN/A 
July 6, 1998 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 235 

(USE FORMAT 7 FOR FULLTEXT) 
TEXT: 

...the availability of FNII a unique new Internet product designed to 
provide a fast and effective method of finding quality food 
information on the World Wide Web. FNII is a fully searchable web site 
describing and indexing. . . 

...Food Science Food Business Food Manufacture & Service Research & 
Education Information Dissemination Consumer Concerns Legislation Precision 

searching is also provided with basic and advanced search options. 
Keywords can be used with both basic and advanced search options. The 
FNII web address is http://www.fnii.ifis.org. A free guest database is 
also available at the site. An annual subscription to FNII, for one user, 
costs . . . 
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05806137 Supplier Number: 50299139 (USE FORMAT 7 FOR FULLTEXT) 
Demographic Marketing 
Inf ormationWeek, p482 
Sept 14, 1998 

Language: English Record Type: Fulltext 
Article Type: Article 

Document Type: Magazine/ Journal; Tabloid; General Trade 
Word Count: 90 

{USE FORMAT 7 FOR FULLTEXT) 
TEXT: 



strategist, a comprehensive database -analysis tool from iMarket, lets 
marketers perform demographic market analysis, including customer 
profiling, market-penetration. . . 

...reporting, and response analysis. The CD-ROM offers integrated 
demographic data via the Dun & Bradstreet database of 10 million U.S. 
businesses, an integrated match and analysis engine, interactive data 
views with user-friendly interfaces, a library of report templates , 
Preformatted reports that perform common marketing queries , drill-down 
capability that allows analysis by individual companies, and money-saving 
automatic suppression options... 



12/3, K/9 (Item 2 from file: 16) 
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04135788 Supplier Number: 4 6035867 (USE FORMAT 7 FOR FULLTEXT) 
Sharing The Workload 
Inf ormationWeek, pl8 
Jan 1, 1996 

Language: English Record Type: Fulltext 

Document Type: Magazine/ Journal ; Tabloid; General Trade 

Word Count: 1693 

... capabilities in Notes for getting information off the Net. For 

example, they could use Notes* searching and retrieval capabilities and 
agent technology to find and import data from the Web. An 
agent-software that performs a rule -based function on the user's 
behalf-could be built to automatically retrieve information from a 
competitor's Web site and import the data into a Notes database 
containing competitive information. Notes could then distribute the 
information to the company sales force. 
InterNotes . . . 
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How Well Do Physicians Use Electronic Information Retrieval Systems? 
Hersh, W. ; Hickam, D.H. 

JAMA, The Journal of the American Medical Association, 1347(1) 
Oct 21, 1998 

ISSN: 0098-7484 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 6377. LINE COUNT: 00571 

ABSTRACT: Health sciences librarians are generally more effective than 
physicians at retrieving information from electronic medical databases 
. Researchers examined studies assessing the effectiveness of online 
search strategies in obtaining medical articles relevant to the 
user's interest. Using databases such as MEDLINE, Library and Information 
Science (LISA) bibliographies, and other sources, librarians obtained 
articles . . . 

...to their topic of interest, compared to 49% relevance for material 
obtained in physician-initiated searches . Database searches often 
returned no articles, or irrelevant articles, frequently resulting from the 
incorrect use of subheadings when searching by index terms. Keyword and 
free-text searching was generally effective and preferred by end-users. 
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10483264 SUPPLIER NUMBER: 21166081 {USE FORMAT 7 OR 9 FOR FULL TEXT) 



Searching pneiimatics on the web. 

Chadwick, B. 

Hydraulics & Pneumatics, v51, n9, p63(3) 
Sept, 1998 

ISSN: 0018-814X LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 2168 LINE COUNT: 00175 

... a guide. But don't stop there, keep on surfing. 

RELATED ARTICLE: A formula for success 
Here's a method I have found to be very effective in finding 
information on pneumatic components. If you know the company name, and the 
product is definitely pneumatic, start with Pneumatics Online. Because it 
contains information only on pneumatics, its database is relatively 
small, so you can quickly find what you're looking for. If it... 

... I suggest going to Fluid Power Web, which contains more information and 
has a larger database on pneumatic as well as hydraulic technology. My 
next choice would be PDEM.net or Manufacturing Marketplace. Databases at 
these sites encompass much more than fluid power, so searching can be 
more cumbersome. If I still don't find the pneumatic information I am 
looking for, I go to the Thomas Register and, lastly, to my favorite 
search engines . 

If, by chance, you need a product but don't know the name of... 
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Online databases: information available electronically. 

Hartman, Stephen W.; Qureshi, Anique; Siegel, Joel G. 
CPA Journal, v67, n4, p46{7) 
April, 1997 

ISSN: 0732-8435 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 2513 LINE COUNT: 00213 

Presentation Methodology 

One way to classify databases is by presentation methodology. The 
methodologies include text, number, image (video), audio, electronic 
services, and software. The earliest public databases were text based, 
and they still are the predominant type of database . As of 1994, text 
based databases represented over 70% of all databases and are 
continuing to grow in number. The user performs searches using text 
phrases to find specific information . Text based databases include 
bibliographic, directory, dictionary, full text, and other. 

Numeric databases are used primarily for transactions... 



12/3, K/13 (Item 4 from file: 148) 
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08388693 SUPPLIER NUMBER: 17968320 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Sharing the workload. (Lotus* Notes 4.0 groupware) (The New High-Tech 
Landscape) (Product Information) 

Stahl, Stephanie 
InformationWeek, n560, pl8(4) 
Jan 1, 1996 

ISSN: 8750-6874 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 17 91 LINE COUNT: 0014 5 

... capabilities in Notes for getting information off the Net. For 

example, they could use Notes' searching and retrieval capabilities and 
agent technology to find and import data from the Web. An 
agent-software that performs a rule -based function on the user's 
behalf-could be built to automatically retrieve information from a 
competitor's Web site and import the data into a Notes database 



containing competitive information. Notes could then distribute the 
information to the company sales force. 
InterNotes . . . 
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How do patrons search the online catalog when no one's looking? Transaction 

log analysis and implications for bibliographic instruction and system 

design. 

Wallace, Patricia M. 
RQ, v33, n2, p239(14) 
Winter, 1993 

ISSN: 0033-7072 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 8252 LINE COUNT: 00658 

... to such devices as a spellchecker and an online thesaurus. 

When to teach end-user searching for maximum benefit has been 
studied extensively. By now, most libraries are offering some general 
training sessions on searching their local OPACs, and some are able to 
offer sessions specifically focused for particular groups... 

...also is available. Barbuto and Cevallos reached several conclusions from 
their study of end-user searching , which should not surprise most 
bibliographic instructors. Hands-on instruction is the vastly preferred 
method of learning to search online, and most searchers will opt for 
this point-of-use-type instruction. (29) It simply is not possible to 
require end users to attend training sessions prior to searching . They 
are far more ...when there is the immediate need to know. Once having used 
the information about the search process, they are more likely to 
remember it and to apply it to their next point-of -use . One area of 
bibliographic instruction that would clearly benefit searchers in advance 
is the most appropriate use of the various catalogs and databases 
available online. In this study, searchers frequently switched to others 
beyond the home catalog. Some of the switches obviously were inappropriate, 
such as searches in the UnCover database for articles by authors long 
deceased. Receiving zero results on a search in the wrong type of 
database often discourages the searcher from pursuing that or other 
options elsewhere. Users should be encouraged to approach any search 
session with the assumption that they will find some information and 
that what is required for success is a process or plan of action. 
Confidence in the outcome of the search is essential, and good 
instruction builds toward that psychological edge. Further and continuing 
collaboration must. . . 



12/3, K/15 (Item 6 from file: 148) 
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(c)2003 The Gale Group. All rts. reserv. 

04591365 SUPPLIER NUMBER: 09048705 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Increase productivity through networking, (accounting systems) 

Yallapragada, RamMohan R. ; Bonvillain, Thomas M. 
National Public Accountant, v35, n5, p28(4) 
May, 1990 

ISSN: 0027-9978 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT 

WORD COUNT: 2570 LINE COUNT: 00204 

. . . productivity: Increased productivity results from developing 

techniques to use the network more efficiently. Because the file server 
has a large disk, finding a file can be time consuming. An efficient 
method to handle this problem is to create a database of file names, 
their descriptions and a code field which indicates the purpose of that 
file. By making the code field a key, the database can be searched for 
by purpose. 



It may be a good idea to document the codes so that. 
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DIALOG (R) File 15 : ABI /Inform (R) 
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The KDD process for extracting useful knowledge from volumes of data 

Fayyad, Usama; Piatetsky-Shapiro, Gregory; Smyth, Padhraic 
Communications of the ACM v39nll PP: 27-34 Nov 1996 
ISSN: 0001-0782 JRNL CODE: ACM 
WORD COUNT: 4 773 

...TEXT: research and application challenges for KDD [4, 5] include: 

Massive datasets and high dimensionality. Multigigabyte databases with 
millions of records and large numbers of fields (attributes and variables) 
are commonplace. These datasets create combinatorially explosive search 
spaces for model induction and increase the chances that a data mining 
algorithm will find spurious patterns that are not generally valid. 
Possible solutions include very efficient algorithms, sampling, 
approximation methods , massively parallel processing, dimensionality 
reduction techniques, and incorporation of prior knowledge. 

User interaction and prior... 
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01063307 97-12701 
The KI companion 

Bjorner, Susanne 

Link-Up vl2n4 PP: 3, 14+ Jul/Aug 1995 
ISSN: 0739-988X JRNL CODE: LUP 
WORD COUNT: 1618 

TEXT: Choosing databases 

Database selection is half the battle in doing a good search . Maybe 
more than half. If you have a question on computer software, for example, 
and there are more than a few databases that touch the topic, you need 
help determining which database to try first. Steve Mcintosh's Kwik 
Knowledge software enables you to scan several competing databases and 
plan an efficient route This month's Searcher 's Query is, "How do I 
find information on the Adobe Acrobat software?" We're looking for 

reviews of this and competing products... 

... on the subject category Computers & Electronics. That single click 
brings up a list of nine databases : Business Software Database , Buyers 
Guide to Micro Software, Computer Database , Compute News Fulltext, Ei 
Compendex Plus, INSPEC, MathSci, Microcomputer Index, and Microcomputer 
Software Guide. 

How. . . 
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An Analytic Measure Predicting Information Retrieval System Performance 

Losee, Robert M. 

Information Processing & Management v27nl PP: 1-13 1991 
ISSN: 0306-4573 JRNL CODE: IPM 



...ABSTRACT: hypertext linkage, and text filtering systems can be measured 
by using historical data or by estimating performance using Bayesian 
probabilistic or artificial intelligence methods . The measurement of 

performance is necessary to evaluate dociament retrieval systems, 
electronic mail filters, office information systems, and, in general, 

retrieval from databases when the searcher has incomplete information 
about the characteristics of the records to be retrieved. A method is 
provided for estimating precision or retrieval quality without examining 
individual database documents. This method requires knowledge of only the 

query or expressed information need and a set of database parameters 
constant for all queries . The analytic expected precision measure is used 
to examine the performance of a system using. ... 
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Using the Micro-Computer to Simplify Database Access: Designing Interfaces 
to Complex Files 

Pratt, G. E. C. 

Journal of Information Science Principles & Practice vlOn3 PP: 131-138 
1985 

ISSN: 0165-5515 JRNL CODE: JIC 

ABSTRACT: The use of the term ''complex'' with regard to an online 
database in reality often suggests that it is considered difficult to 
extract from the database the required data. Just as the online thesaurus 
has removed the ''complexity'' from many bibliographic... 

... by providing a simplified infrastructure, it is equally possible to 
simplify several other types of database searching . The following 
questions are addressed: 1. Can programs automatically generate search 
strategies ? 2. Can programs perform further analysis of the 
retrieved subset of records from a database ? 3. Can these methods 
apply to the special case of graphical representation of data? An 
inexpensive searching tool to make database more accessible is 
proposed. The method involves the integration of available and well-known 
searching techniques . ... 
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01172475 CMP ACCESSION NUMBER: IWK19980914S0125 
Demographic Marketing (What's Hot) 
INFORMATIONWEEK, 1998, n 700, PG482 
PUBLICATION DATE: 980914 

JOURNAL CODE: IWK LANGUAGE: English 

RECORD TYPE: Fulltext 

SECTION HEADING: Behind The News 

WORD COUNT: 90 

TEXT: 

Strategist, a comprehensive database -analysis tool from iMarket, 
lets marketers perform demographic market analysis, including customer 
profiling, market-penetration. . . 

...reporting, and response analysis. The CD-ROM offers integrated 
demographic data via the Dun & Bradstreet database of 10 million U.S. 
businesses, an integrated match and analysis engine, interactive data 

views with user-friendly interfaces, a library of report templates , 
Preformatted reports that perform common marketing queries , drill-down 
capability that allows analysis by individual companies, and money-saving 
automatic suppression options... 
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Sharing The Workload - Notes put Lotus Development ahead of the field in 

groupware. Will Notes 4 keep Lotus in front? (Lotus Notes) 
Stephanie Stahl 

INFORMATIONWEEK, 1996, n 560, PG18 
PUBLICATION DATE: 960101 
JOURNAL CODE: IWK LANGUAGE: English 

RECORD TYPE: Fulltext 

SECTION HEADING: Hot Trends For '96 - Groupware 
WORD COUNT: 168 9 

... capabilities in Notes for getting information off the Net. For 

example, they could use Notes' searching and retrieval capabilities and 
agent technology to find and import data from the Web. An 
agent-software that performs a rule -based function on the user's 
behalf-could be built to automatically retrieve information from a 
competitor's Web site and import the data into a Notes database 
containing competitive information. Notes could then distribute the 
information to the company sales force. 
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Voice Under IP: A New Approach to Convergence. (extensible service policy 
architecture) (Technology Information) 

Graves, Richard C. 
Telecoirununications, 33, 10, 50 
Oct, 1999 

ISSN: 0278-4831 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 1904 LINE COUNT: 00160 

... not need detailed technical knowledge. 

One functional element of a service portal is a call policy server, 
which performs the search and interpretation function, locating the 
controlling Web page and translating its content into switch functions. 
In a world where carriers might have thousands... 

...confers powerful flexibility without the complexity of rigidly 
organizing all customer data into a relational database , The 
interdependency between customer applications is broken, Just as a 
nonfunctional Web page does not... 
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Data Profiling. (Seven products from five companies) (Product Announcement) 

Olson, Jack 

Enterprise Systems Journal, 14, 5, 34{1) 
May, 1999 

DOCUMENT TYPE: Product Announcement ISSN: 1053-6566 LANGUAGE: 

English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 1908 LINE COUNT: 00158 

the data scrubbing mechanisms needed and assessing the time and 
cost of scrubbing. 

The target database design should be based on the information 
gathered during profiling. This gives the project team... 

...where the target is already set, as in moving to an ERP-packaged 
application, the profile will be indispensable in performing the 
matching process between the data available and the data requirements of 
the target. 

Each level of profiling contributes information essential... 
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Internationalize your Java applications: create software packages that 

display information according to the customs of local users . (Technology 

Tutorial) 

Valesky, Tom 

e-Business Advisor, vl6, n9, p50(4) 
Sept, 1998 

LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 24 99 LINE COUNT: 00224 

this point) . 

If a large number of strings are compared, (for example, if you're 
searching or sorting a long list of strings ) , a more efficient way of 
comparing strings is to use a Collator object to find the 



CollationKey of each string and then compare CollationKeys using the 
CollationKey . compare ( ) method. As . . . 
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Letters. (Letter to the Editor) 

Seybold Report on Internet Publishing, vl, n4, p2(3) 
Dec, 1996 

DOCUMENT TYPE: Letter to the Editor LANGUAGE: English 

RECORD TYPE: Fulltext 

WORD COUNT: 1593 LINE COUNT: 00137 

Our concept at Extelligent of creating structured NetEditions via 
"data matching" as opposed to "data searching " is fundamental to the 
design and performance of Profile Publisher. 

Data matching is a much more thorough and accurate method of 
assembling information into structured editions of... 

...inaccuracies of attempting to assemble a similar edition with the same 
level of integrity using search engine technologies, regardless of 
whether or not subscriber profiles are a part of the initial search 
process . 

The typical business publisher, and their business subscribers, can 
hardly be well served with. . . 
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Applied Physics Letters Online: a case study in online journal publishing, 
(includes related articles on the American Institute of Physics* 
editorial and production operations, on the service's design, on other 
online activities of the institute, and on other journals on the Web) 
(Company Business and Marketing) (Cover Story) 
Walter, Mark 

Seybold Report on Publishing Systems, v25, n8, pl2(10) 
Dec 31, 1995 

DOCUMENT TYPE: Cover Story ISSN: 0736-7260 LANGUAGE: English 

RECORD TYPE: Fulltext 

WORD COUNT: 7633 LINE COUNT: 00593 

... issue cover to cover, yet don't want to miss important developments 

in their specialties. 

* Searching . Indexing is a labor-intensive process. Although 
computers still lack the expertise of human indexers in indexing concepts, 
they are much faster and more efficient at indexing words and phrases . 
If you want to find articles about a topic that is consistently 
described the same way (e.g., Faraday rotation, particle... 
...seconds, the system returns to you a list of the articles that have met 
your search criteria, and they are immediately ready to be viewed or 
printed. 

* Updates. In the online... 
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Untangling the Internet, (includes list of Usenet News Groups and access 
provider services) 

Courtney, Philip E. 



Enterprise Systems Journal, vlO, n3, pl6(5) 
March, 1995 

ISSN: 1053-6566 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 1975 LINE COUNT: 00194 

... as a novice. Accessing The Internet 

Like any type of project, one of the most effective methods for 
locating information on the Internet is to divide and conquer. With two 
million connected computers, ignorant access and unnecessary searches 
will result in tremendous connect-time charges while consuming resources 
and diminishing response time for. . . 
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OLE revision forms crux of Microsoft data repository. 

Foley, Mary Jo 

PC Week, vl2, nil, pi (2) 

March 20, 1995 
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WORD COUNT: 64 3 LINE COUNT: 0O054 

... the first quarter of next year. 

While building an OLE-based front end for the repository will make 
it easier for Microsoft and third-party developers to store, find , 
modify, and distribute repository objects , Microsoft must improve OLE 
performance for this strategy to work, said a developer familiar with 
Microsoft's plans. 

Attempting to build OLE-enabled. . . 
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Remote possibilities: database access from the remote point of view means 
re-evaluating IS strategies . 

Krivda, Cheryl D. 

MIDRANGE Systems, v7, n20, p34(2) 
Oct 28, 1994 
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WORD COUNT: 14 08 LINE COUNT: 00111 

ABSTEIACT: IS managers should consider system complexity and communications 
concerns when constructing remote database access systems because 
client/server environments can become extremely difficult to manage 
efficiently. When designing. . . 

. . .personnel tend to underestimate remote client knowledge and activity 
volume. Systems can become overloaded with database access requests from 
remote sources. Controlling user access can be accomplished by educating 
users on the most efficient method to find necessary information . 
System managers also must determine how databases will be updated and 
synchronized. IS managers must answer questions such as procedures for 
communications. . . 
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Digging through your data, (overview of four evaluations of four 

text-processing programs) (individual evaluation records searchable under 



"Digging through Your Data") (Software Review) (Network Edition: First 
Looks) (includes related articles on the editor's choice, PC DOCS 2.0, 
document management standards and the products' suitability to task) 

(Evaluation) 

Garris, John 

PC Magazine, vl3, nl9, pNEl(4) 
Nov 8, 1994 

DOCUMENT TYPE: Evaluation ISSN: 0888-8507 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT; ABSTRACT 
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... let users access documents from remote locations. 

Document retrieval: Here we evaluate the package's search tools. 
With document management software, we want to be able to perform both 
profile and full-text searches on documents located on any server on 
the network. We look for Boolean logic to let us combine multiple search 
criteria, and we give extra points for intelligent searching mechanisms, 
such as proximity searches or thesaurus-based searches . 

Ongoing maintenance: Once the DMS is installed, we assess 
administrative issues. First, we look at... 
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Data on demand. (Software Review) (overview of 15 evaluations of 

client/ server front-end databases) (includes related articles on tips for 
using client/server, pros and cons of adopting client/server 
architecture, database servers, middleware, spreadsheet querying tools, 
ODBC architecture, hidden costs of downsizing, glossary of terms. 
Experts ' Pick) (Evaluation) 
Watterson, Karen 

Windows Sources, v2, n2, p205{31) 
Feb, 1994 

DOCUMENT TYPE: Evaluation ISSN: 1065-9641 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT; ABSTRACT 
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select New Query, the results of which will come directly into your 
current spreadsheet. Additional Database menu items let you find , 
add, or delete records ; perform crosstabs; send SQL strings 
directly 

to your back end; and even create new tables on the back end 
(assuming . . . 
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Index structures for Selective Dissemination of Information under the 
boolean model . 

Tak W. Yan; Garcia-Molina , Hector 

ACM Transactions on Database Systems, vl9, n2, p332(33) 
June, 1994 

ISSN: 0362-5915 LANGUAGE: ENGLISH RECORD TYPE: ABSTRACT 

ABSTRACT: A study was conducted on formulating index structures for 
indexing algorithms and profiles that effectively match documents 
against voluminous profiles is discussed. The index structures are 
intended to facilitate Selective Dissemination of Information (SDI), which 



...a service involving the automatic provision of information regarding new 
additions for users who submit queries that are constantly evaluated. The 



study was lifted from part of the ARPA Electronic Library. 
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Retrieval tool incorporates graphics, video. (Imspace Systems Corp.'s Kudo 
Image Browser) (Brief Article) (Product Announcement) 

Computing Canada, vl9, n4 , p48(l) 
Feb 15, 1993 

DOCUMENT TYPE: Product Announcement ISSN: 0319-0161 LANGUAGE: 
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scroll bar, Riffle feature or Find command. 
Visually browsing through a single catalogue is an effective 
method of locating the right image without launching multiple 
applications and searching through image files one by one, the company 
claims . 

Retailing for $295 (U.S.), Kudo... 
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Develop and display your help messages in dBASE IV. (dBASE Developer 
column) (Tutorial) 

Rubel, Malcolm C. 

Data Based Advisor, vll, nl, pll0(6) 
Jan, 1993 
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RECORD TYPE: FULLTEXT; ABSTRACT 
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...ABSTRACT: SEEKS were developed in the help marker program with one or 
two SEEKS on the search string. The first seek finds the record that 
matches the primary helpmarker string while the second SEEK is 
performed only in the case of a menu match and when the MENU. . . 
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Perl: the programmer's toolbox, (an integrated software development tool 
for UNIX) (Cover Story) 

Sharp, Oliver 

Computer Language, v9, nl2, p40{6) 
Dec, 1992 

DOCUMENT TYPE: Cover Story ISSN: 074 9-2839 LANGUAGE: ENGLISH 

RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 2751 LINE COUNT: 00202 

Especially if you are already writing shell scripts that manipulate 
processes, manage large numbets of files , search for complex string 
matches in files , perform .simple networking tasks, or filter text 
files in various ways, you will benefit from Perl. And you will gain even 
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BASIC functions : NewWave ATL offers several functions for writing agent 
tasks. (PC Tips) 

Kehoe, Miles B. 

HP Professional, v5, nil, p70(2) 
Nov, 1991 

ISSN: 0896-145X LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 976 LINE COUNT: 00076 

You can use them to extract substrings from constants, to check 
user input, or to perform any other operation involving string data . 

The FIND function lets you search a string for a particular 
substring starting at a specific byte position. The syntax of. . . 
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Hyperties. (Cognetics Corp. *s Hyperties 2.36) (Software Review) (one of 

eight evaluations of hypertext software products in '3-D reading with the 

hypertext edge ' ) (evaluation) 

Fersko-Weiss, Henry 

PC Magazine, vlO, nlO, p260{2) 

May 28, 1991 
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RECORD TYPE: FULLTEXT; ABSTRACT 
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highlighting an entry. 

You can look for articles that contain specific words by using the 
Search function's AND or OR connectors. You can also search for words 
that follow each other, like George Washington, but you're restricted to 
just two words. This makes searching fast but limited. The commands 
listed on-screen are supplemented by hotkeys for quick maneuvering. Hit 
CtrlT and the program takes you to the first article found in the search . 
Using these search methods , you can quickly and efficiently find 
the information you want . 

Cognetics Corp. has a clearly defined vision of how hypertext 
software should work. . . 
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A faster way to find blobs. (LSI Logic Corp's L64290 feature extraction 
chip contains object contour tracing algorithms) 

Dolan, Simon 

ESD: The Electronic System Design Magazine, vl9, nil, p71{4) 
Nov, 1989 

ISSN: 0893-2565 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 2303 LINE COUNT: 00173 

. , .ABSTRACT: matching uses such information as slope, curvature and 
perimeter to represent objects in a more efficient manner than template 
matching . The data extracted by contour matching is also insensitive 
to rotation, scaling and translation. The L64290's data-dependent algorithm 
unambiguously traces every contour of an input image once by searching 
for non-zero pixels and then iteratively determining neighboring non-zero 
pixels. An image of... 
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Awk: another successful DOS port from UNIX, (Software Review) 

Parker, Tim 

Computer Language, v6, n4, p89(4) 
April, 1989 

ISSN: 074 9-2839 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 2123 LINE COUNT: 00153 

extract information such as totals and transforming data from one 
format to another. 

Awk will search an input file for characters matching a 
specified string and perform some predetermined action when the 
characters are found. This sequence is referred to as a. . . 
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Evidence-based recognition of 3-D objects. 

Jain, Anil K. ; Hoffman, Richard 

IEEE Transactions on Pattern Analysis and Machine Intelligence, vlO, n6, 

p783{20) 

Nov, 1988 

ISSN: 0162-8828 LANGUAGE: ENGLISH RECORD TYPE: ABSTRACT 

...ABSTRACT: but a technique is described for adding rules of evidence for 
an object to the database from training views of the object. Enough 
evidence conditions are generated to enable successive matching of new 
views of the object . The recognition strategy performs successfully 
in polynomial time for both synthetic and real range images in a variety of 
positions . . . 
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Applications - Traffic Managers Train on Computers. 

THE Journal, vll, n8, p86 
May, 1984 

ISSN: 0192-592X LANGUAGE: ENGLISH RECORD TYPE: ABSTRACT 

. . .ABSTRACT: mileage scales, business logistics management, warehousing 
and industrial traffic management. With access to an updated data base , 
students have up-to-date information , and can find the most cost- 
effective shipment methods . Time is then available for analytical work. 
Minimal input will result in screens that show. . . 
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... is staggering, and all of our online services are designed to ease 

the process of searching for and utilizing vital news clips," said Kelly 



B. Schmitt, Vice President of Luce Online. "With the Luce services now 
available on SageMaker's SageNews, users are able to more effectively and 

strategically locate real-time news clips from thousands of leading 
daily and community news papers, magazines and e- zines." 
SageMaker ' s . . . 
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WebSearchExperts.com to be the First to Dominate the New Internet Search 
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Business Wire, pl390 
Oct 21, 1999 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 597 

need for this type of service within his organization. He was 
curious about the most efficient methods of finding information , 
and in the process, became an expert at searching the Internet for the 
most relevant information. Word quickly spread across the enterprise and 
Doug became inundated with requests from others to conduct Internet 
searches . The idea for WSE had surfaced. 

After just one month in service, the company has... 
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JCAHO Compiles Compendixam of Health Plan Performance Measures 

Medical Utilization Management, v25, n8, pN/A 
May 1, 1997 

Language: English Record Type: Fulltext 
Document Type: Newsletter; Trade 
Word Count: 331 

A sample search would include looking up, say, "utilization, 
frequency of procedures" in the index, where page numbers identify all such 
measures. You'd find within these listed pages NCQA common-format 
profiles for the following performance measures: occurrence of 
angioplasty, cardiac catheterization, coronary artery bypass graft, 
cholecystectomy, hysterectomy, laminectomy/discectomy, and. . . 
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March 18, 1991 

Language: English Record Type: Fulltext 
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... At Limited, most credit applications are processed in an instant 

credit environment. A back-end search is then performed using a looser 
method of matching of information to detect application fraud. While 
this does not delay the application process, the finding of. . . 
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compared the cartridge shell to shells recovered at the crime 

scene . 

Now, Drugfire offers an efficient computer-aided matching 
method that uses a computer data file of photographs of spent shell 
casings recovered at crime scenes. A seized weapon can... 

...fired and the distinctive firing pin mark can be automatically compared 
with those in the database to determine if it matches casings from any 
previous unsolved crimes. 

FBI officials hope to. . . 
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Critters, Creatures, Catfish and Worms . 

Ramos, Rebekah 

Technology & Learning, v20, n8, pll 
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. . . curriculum materials, you can access a Discussion Board or go to 

the Teacher Tips Web page . In both locations, you'll find information 
about the most effective methods for assessing student performance 
and how to take advantage of all the resources available on the site. You 
can. . . 

...the data to create a comparative study of migrations from year to year. 
A simple search engine is available as well. 

Overall, Journey North provides a tremendous opportunity for students 

to. . . 
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Data Management Via the Web. (IntraNet Solutions Inc's Intra.doc 3.0 
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... a flexible document management system that could provide customized 

interfaces for each case, full-text search capabilities and automatic 
indexing. "Our goal was to move away from traditional methods of resource 



and information gathering, and provide our clients with a more efficient 
and cost- effective method of disseminating information , " says 
Jenkins. "Our challenge in finding a dociiment management system was 
that we required users in multiple offices to be on the same... 
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...relative Internet information. The systemrecords mouse clicks and 
keyboard operations, and by using its own search mechanisms, is able to 
effectively find relative information .This method saves time and 
effort when compared to the "keyword" entry method of Internet searches . 
The system also responds to agents (management software) , so it is able to 
individualize its searches even if one terminal has several users. The 
service will be offered on an experimental basis to subscribers to NTT's 
Internet search service. 
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... utility to diagnose the problem and treat it in the same place. 

* A directory service database must be scalable. Breaking a single 
database into pieces and distributing them across multiple servers 
improves performance , because the information is strategically 
located closer on the network to the user. 

* A directory service database must be fault-tolerant... 
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... to store unlimited sets of business information with each asset, 

including keywords, text descriptions, dates, strings and numbers. Users 
can perform queries on this information over the Web to locate 
images that meet their criteria. 

Once users locate their desired images, they can download low 



resolution. 
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Page quits suddenly as president of Donohue 
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ABSTRACT : 

...cabinet minister, was a figure of some controversy, Donohue, a Quebecor 
unit, has formed a search committee to find Page 's successor . Page 

plans to announce his future plans, according to a tersely-worded 
statement he left; he was... 
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sophisticated filter applications that screen large volumes of data 
automatically. The better systems employ a search algorithm recommended 
by the New York Clearing House Association, which uses stop descriptors, 
key words, or combinations of words to identify potential OFAC violations. 
Searching with stop descriptors or "tokens" has proven to be the most 
effective method both in identifying true matches and reducing false 
hits . 

Some of these systems can be equipped with private file creators 
that allow banks to. . . 
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ABSTRACT: Doctors can do much to help their patients find performance 
data on health care plans and services. Many health care plans report 
the number of services they provide, such as childhood vaccinations and 
other preventive services. These reports are collected in databases such 
as the HEDIS database . This information can then be used by the public to 
find health care plans that... 

...common users of this information. Doctors and patients were much less 
likely to use these databases . 
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broader search that can retrieve useful hits when the Web search 
engines fail. 

CHOOSE YOUR STRATEGY 

As with any search strategy , effective searching becomes a 
matter of matching the particular information need with , a strategy that 
can answer it. Use these additional strategies as starting points when 
developing your own repertoire of Internet search techniques. 

Communications to the author should be addressed to Greg R. Notess, 
Montana State University. . . 
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most comprehensive library available on roadway work-zone safety. 
Users' can take advantage of the databases and electronic links to find 
information about the following topics : 

* Successful strategies for public education and outreach 
programs . 

* State specifications related to work-zone safety. 

* Training courses... 
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... area. Uffda! is a site made by Minnesotans about Minnesotans. 

Uffda! provides users with two methods of finding information . 
Users can perform "subject" searches by navigating through directories 
to find the topic and Web site they want or they can perform "keyword" 
searches using the new Uffda! search engine. Users can also submit a 
Minnesota site to Uffda! to be included in the... 
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The ten most effective ways to search WorldCat ob FirstSearch: a practical 
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... of patron training include the following: * to introduce patrons to 

the most effective ways of searching the database . This involves 
teaching them to precisely identify and understand their topics, 
instructing them how to deconstruct their topic into meaningful concepts 
for searching , and then how to use the basic Boolean operators to 
formulate an effective search strategy . * to introduce the concept of 

record and fields to locate the field tags in the WorldCat database 
using the h label command. These field tags are then used with examples to 
demonstrate . . . 

...and relevant retrieval. * to train patrons to focus in on relevant items 
using bound-phrase searching for more precise searching e.g., how to 
actively use controlled vocabulary (Library of Congress Subject Headings). 
* to train patrons to limit a search by using the Boolean AND in addition 
to the limit command. This command in the WorldCat database limits by 
year, language, and publication type. * to introduce the concept of 
precision searching using available proximity operators and bound-phrase 
(Wordlist) search capabilities. * to introduce sophisticated ways of 
searching for specific topics; e.g., theses, dissertations, films, videos, 
computer software, maps, etc. 
Essentially, this. . . 
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hospital will be able to incorporate those best practices as 
appropriate . 

SunHealth's perspective comparative database and its clinical 
benchmarking teams will have diagnosis-specific measures to enhance the 
-effectiveness of... 

. . .Without these diagnosis-specific measures, process improvement plans are 
difficult to create, and hospitals will find it difficult to dociament 
the success of care and cost management plans to employers, patients, 
and the community. 

LESSONS LEARNED 

Based on the experience of benchmarking operational... 
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Managing the costs of on-line information. 

Alderson, Pat 
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... the company's information eggs in one basket, encourages 

competition among vendors and stabilizes costs. 

Finding the most efficient method of information access is 
another way to contain on-line costs. The telecommunications field offers 
multiple options... 

...services. Travelers uses the IBM Information Network for connection to 
some of its on-line database providers. The network provides discounts to 
volume customers at fast transmission speeds without the need. . . 
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Moving mainframe applications to LANs, (local area networks) (includes 
related articles on downsizing and file conversion strategies) 
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... to successfully implement a downsized system. 

As data processing moves toward information processing, and as 
information becomes a strategic corporate asset, finding an efficient 

and cost-effective way to make it available to the user assumes higher 
importance. There.,. 

...supported systems. In view of the developments in the technology and 
business marketplace, even large database mainframe applications seem 
destined to give way to networked configurations using sleeker and more 
efficient . . . 
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Agenda version 2.0. (Software Review) (one of six evaluations of PIMs in 
'Taking notes of PIMs*) (Evaluation) 

Marshall, Patrick; Marcus, Ann M. 
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can always create new views or modify those included. 

Views are the primary way to locate and display information . But 
you can also perform a character- string search within any view or 
across an entire database . Agenda lets you specify whether to search 
items alone, the notes attached to items, or both. 

You can use wild cards in... 
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The Los Angeles Times: a special kind of database, (part two: Dow Jones 
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... of hits by using the "Display Hit Scroll" command. 

Highlighting 

Both systems automatically highlight the search word or phrase by 
placing an asterisk in the left margin of the record . We find this an 
especially effective method of locating search tetras and much prefer 
it to highlighting the search terms within the text. This is still 
possible on Dow Jones, however, via the "..Set... 
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. . . The basic idea behind the DowQuest approach is that words in an 

initial typed-in query or in later selected example documents and 
paragraphs are checked very quickly in parallel against... 

...just the 100 most frequent words from each document. Each processor 
basically asks itself, "Which documents do I own that match the 
profile of the query ?" 

Matching is performed by tallying word occurrences, combining 
scores for different words (basically using an implicit Boolean "or" 
between query words) and then normalizing the score to take into account 
the size of documents and length of query . DowQuest then sorts the 
documents by their scores and displays headlines in this order. Documents 
containing more of the query words and more of each particular word will 
be generally higher in significance. Also, documents with phrases (e.g., 
George Bush or nuclear fusion) , where query words occur closer together, 
are ranked more highly than documents in which the query words are 
scattered. 

You could achieve a similar effect of "Completeness with ranking" in 

a . . . 
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ABSTRACT: The on-line search is a way of finding information on 
topics quickly and efficiently . The on-line search methods of DIALOG 
Information Services are discussed, along with a three figure display of 
the search procedure. The importance of identifying clearly the topic to 
be researched is emphasized, and the... 
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Word Count: 987 *Full text available in Formats 5, 7 and 9* 

TEXT: 

...a subject-specific approach; research written by analysts with practical 
experience; customized services; and an efficient method to search , 
find , and retrieve information . Giga believes existing IT advisory 
services do not meet these needs, and that another key. . . 
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Rev. Proc. 98-22, relating to the employee plans compliance resolution 
system 

Anonymous 
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ISSN: 0040-0025 JRNL CODE: TXE 
WORD COUNT: 5200 

...TEXT: to the participant or beneficiary, including applicable overhead 
and amounts paid to outside consultants to locate and review records , 
develop databases , and perform calculations. 

In large, complex plans with a substantial number of years of operation, 
employers may expend millions of dollars in order to retrieve ancient 
records, build computerized data bases , locate all affected 

participants and employees, and attempt to fully and precisely correct all 
failures . . . 
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The costs of confusion in cyberspace 

Oram, Jon H 
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...ABSTRACT: in this case legitimizes and encourages the guessing of domain 
names as a method of searching for web sites at the expense of the other, 
comparatively more efficient , methods of locating sites . The court 
should have equated the practice of guessing domain names with user 
carelessness and. . . 

...similar sounding domain names, which might eliminate the guesswork as an 
effective method of Internet searching . This in turn would promote 
alternative technologies, such as more expansive and better categorized 
directories . . . 
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Ordering author and work records: An evaluation of collocation in online 
catalog displays 

Carlyle, Allyson 

Journal of the American Society for Information Science v47n7 PP: 538-554 

Jul 1996 
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...ABSTRACT: authors and works, a survey compared the displays resulting 
from 5 author and 5 work queries in 18 online catalogs. Dependent 
variables to measure collocation included the number of times irrelevant 
records were interfiled among relevant records. Searches for worst-case 
authors and works associated with large retrieval sets, including "Homer" 
and "Paradise Lost, " revealed the effects of Boolean versus string 
matching, query type, and catalog size on the collocation of relevant 
records. Results of the survey showed that string matching collocated 
relevant records more successfully than Boolean matching , that author 
records were collocated more successfully than work records, and, 
surprisingly, that catalog size had only a... 
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...TEXT: disc using built-in support for CD-Recordable drives. 

The Windows-based Golden Retriever text search and retrieval software 
uses full-text indexing. Searches can be performed using text string 
matching or searching for specific file properties. Integrated 
search modules are royalty-free. Golden Retriever is available in 
single-user ($395) or multiuser versions... 
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...TEXT: text you need. Abbott Systems' RescueTxt is designed to help you 
solve both problems. 

RescueTxt searches all of your hard drives for the presence of any text 
string. Unlike other utilities that search for text, RescueTxt reads the 
disk directly, block by block — so it can actually seek. . . 

... you've already deleted, provided nothing has written over the deleted 
file I had RescueTxt search for text in a number of documents, some 
recently deleted, and it was consistently successful in finding the 
correct search strings . 



Unfortunately, even when you locate the text in a file that isn't 
deleted, RescueTxt doesn't tell you the file's name, so you... 
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...TEXT: be interpreted as such a representation and establishes the 
indicated key bindings. 

Image EMACS implements search by comparing a sequence of reference 

character images (the search string ) to successive character images 
in the buffer until the match score exceeds a preset threshold. The 
search string can be specified by typing, in which case it is constructed 

from the character images that are bound to the keys. Alternatively, the 
search string can be specified using the mouse, either by dragging the 

mouse across a sequence... 
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. . . TEXT : Corporation 

* 72 RETAIL SECURITY STRATEGIC SEARCHING FOR ONLINE INFORMATION SERVICES 

Ms. Guenther will discuss the strategic searching of online 

information to perform background assets, locates , and business due 
diligence . 

Moderator: Kenneth F. Leonard, CPP, Certified Investment & Prevention 
Services 

Speaker: Sharon... 
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...TEXT: every catalog company should be to better understand its 
customers." And one of the most efficient methods of collecting 

customer data is through market research. 

Rosen finds that the larger the catalog company, the more likely it is to 
have a market search program. Of the smaller consumer companies (under $1 
million in sales), only 63% have a... 
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...ABSTRACT: of determining what information is of interest to the employee 
while minimizing the amount of search through irrelevant information. 
Several information-retrieval (IR) methods for filtering TMs were tested. 
The research. . . 

randomly selected TMs, the current distribution method, and an 
organizationally based filtering method. The LSI match - document 
profile method proved to be the most successful of the 4 filter 
methods . 
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...ABSTRACT: at Quaker Oats Co. (< 
system that allows executives to 
a menu-driven command structure 
data frequently. . . 

... so that it can be integrated 
system that locates relevant 
out extraneous data. 



Chicago) , created a proprietary front-end 
search various online databases using 
He also devised a system for collecting 

into internal systems, and 2. creating a 
information efficiently and filters 
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Chinese Online - Problems and Solutions 
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...ABSTRACT: entering characters into the computer to create source files 
as well as entering characters as search keys to retrieve stored 
information. There are distinctive differences between the English and 
Chinese languages. . . 

... order to facilitate Chinese text editing and information retrieval. It 
is suggested that regular expression searching be used to facilitate 
character entry and retrieval. With this method, procedures must be 
designed to control noisy information and to find the regular 
expressions that provide the best performance . 
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... utility to diagnose the problem and treat it in the same place. 

- A directory service database must be scalable. Breaking a single 
database into pieces and distributing them across multiple servers 

improves performance , because the information is strategically 
located closer on the network to the user. 

- A directory service database must be fault-tolerant... 
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Cache panache 

A well -placed caching device can boost Web site performance and cut WAN 
costs . 

Byline: Steven G. Clegg 
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Text: 

. . . lengthen the cache refreshness time to limit the number of times the 
cache has to search for the origin server. Another algorithm can send a 
"get if modified" request to the... 

... copies, storing like information contiguously, in addition to setting 
cluster table . updates. With good price/ performance , hit rate, algorithms 
and a strategy to locate caches, an object saved to cache could be 
money earned. Clegg is founder and principal of Beacon Strategies... 
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